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(54) DNA POLYMERASE-RELATED FACTORS 

(57) The present invention relates to a thermostable 
DNA polymerase-associated factor capable of enhanc- 
ing DNA synthesizing-activity of a DNA polymerase; a 
thermostable DNA polymerase-associated factor pos- 
sessing an activity of binding to a DNA polymerase and 
a method for producing the same; a gene encoding the 
DNA polymerase-associated factor; a method of DNA 
synthesis by using a DNA polymerase in the presence 
of the DNA polymerase-associated factor; and a kit 
comprising the DNA polymerase-associated factor. 
According to the present invention, there can be pro- 
vided in vitro DNA synthesis and a DNA amplification 
system which are more excellent than conventional 
techniques by utilizing the DNA polymerase-associated 
facta of the present invention. 
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Description 

TECHNICAL FIELD 

5 [0001 ] The present invention relates a DNA polymerase-associated factor. More specifically, the present invention 
relates to a DNA polymerase-associated factor which is useful for a reagent for genetic engineering and a method for 
producing the same, and further a gene encoding thereof, and the like. 

BACKGROUND ART 

10 

[0002] DNA polymerases are useful enzymes for reagents for genetic engineering, and the DNA polymerases are 
widely used for nucleotide sequencing of DNA, DNA labeling, site-directed mutagenesis, and the like. Also, thermosta- 
ble DNA polymerases have recently been remarked with the development of the polymerase chain reaction (PCR) 
method, and various DNA polymerases suitable for the PCR method have been developed and commercialized. 

is [0003] Presently known DNA polymerases can be roughly classified into four families according to amino acid 
sequence homologies, among which family A (pol I type enzymes) and family B (a type enzymes) account for the great 
majority. Although DNA polymerases belonging to each family generally possess mutually similar biochemical proper- 
ties, detailed comparison reveals that individual enzymes differ from each other in terms of substrate specificity, incor- 
poration efficiency of a substrate analog, primer extensibility and extension rate, mode of DNA synthesis, association 

20 of exonuclease activity, optimum reaction conditions of temperature, pH and the like, and sensitivity to inhibitors. There- 
fore, those possessing most appropriate properties for the applications have been selected among all available DNA 
polymerases, and the selected DNA polymerase has been used. 

[0004] A hyperthermophilic archaebacterium Pyrococcus furiosus has produced a DNA polymerase belonging to 
a type, and its gene has already been isolated [Nucleic Adds Research, 21 , 259-265 (1 993)]. 
25 [0005] As DNA polymerases, in addition to ones expressing their functions with only one kind of an enzyme protein, 
such as the pol I type enzyme or the a type enzyme, there have been known oligomer enzymes constituted by a large 
number of subunit proteins. In addition to the protein serving as a DNA polymerase, there have also been known some 
cases where protein molecules for regulating their functions coexist 

30 DISCLOSURE QF INVENTION 

[0006] An object of the present invention is to provide a thermostable DNA polymerase-associated factor capable 
of enhancing DNA synthesizing-activity of a DNA polymerase, and a thermostable DNA polymerase-associated factor 
possessing an activity of binding to a DNA polymerase. 
35 [0007] Another object of the present invention is to provide a gene for the DNA polymerase-associated factor of the 
present invention. 

[0008] Still another object of the present invention is to provide a method for producing the DNA polymerase-asso- 
ciated factor of the present invention. 

[0009] Still another object of the present invention is to provide a method of DNA synthesis by using a DNA 
40 polymerase in the presence of the DNA polymerase-associated factor of the present invention. 

[0010] Still another object of the present invention is to provide a kit comprising the DNA polymerase-associated 
factor of the present invention. 

[0011] According to the present invention, there can be provided in vitro DNA synthesis and a DNA amplification 
system which are more excellent than conventional techniques by utilizing the DNA polymerase-associated factor of the 
45 present invention. 

[001 2] Recently, a novel DNA polymerase having completely no structural homology to conventionally known DNA 
polymerases has been found by the present inventors from hyperthermophilic archaebacterium Pyrococcus furiosus 
(WO 97/24444 Pamphlet). In this DNA polymerase, two kinds of novel proteins form a complex and exhibit a DNA 
polymerase activity. In addition, the enzyme exhibits a potent 3* -> 5' exonuclease activity and excellent primer exten- 

so sion activity. For example, when the enzyme is used for PCR, a DNA fragment of the size of about-20 kb can be ampli- 
fied. In this novel DNA polymerase derived from Pyrococcus furiosus, although at least two kinds of proteins are 
essential constituents in the enzyme activity, it has not been elucidated whether or not a constituent protein of the 
enzyme beside the above exists, or whether or not a factor having an influence on the activity of the enzyme exists. 
[0013] As a result of intensive studies, the present inventors have succeeded in isolating a protein binding to the 

55 novel DNA polymerase derived from Pyrococcus furiosus. Further, they have found that the production of the protein 
by genetic engineering is made possible by cloning the gene, and moreover that a DNA synthesizing-activity of a DNA 
polymerase is enhanced. 

[0014] In sum, the present invention relates to: 
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[1] a thermostable DNA polymerase-associated factor capable of enhancing DNA synthesizing-activity of a DNA 
polymerase; 

[2] the DNA polymerase-associated factor according to item [1] above, further possessing an activity of binding to 
a DNA polymerase; 

5 [3] the DNA polymerase-associated factor according to item [2] above, which possesses an activity of binding to a 
DNA polymerase comprising a DNA polymerase-constituting protein having the amino acid sequence as shown in 
SEQ ID NO: 5 or 6 in Sequence Listing; 

[4] the DNA polymerase-associated factor according to any one of items [1] to [3] above, comprising at least one of 
amino acid sequences selected from the group consisting of SEQ ID NOs: 1, 3, 19, 27, 34, 64, 70 and 80 in 
10 Sequence Listing, or an amino acid sequence resulting from substitution, deletion, addition or insertion of one or 
more amino acids in at least one of the amino acid sequences; 

[5] a gene encoding a DNA polymerase-associated factor, wherein the factor comprises at least one of amino acid 
sequences selected from the group consisting of SEQ ID NOs: 1,3,19, 27, 34, 64, 70 and 80 in Sequence Listing, 
or an amino acid sequence resulting from substitution, deletion, addition or insertion of one or more amino acids in 
is at least one of amino acid sequences, and possesses an activity of enhancing DNA synthesizing-activity of a DNA 
polymerase; 

[6] the gene according to item [5] above, comprising a nucleotide sequence selected from the group consisting of 
SEQ ID NOs: 2, 4, 18, 26. 33, 63, 69 and 79, or a nucleotide sequence resulting from substitution, deletion, addition 
or insertion of one or more bases in the nucleotide sequence; 

20 [7] a gene capable of hybridizing to the gene of item [5] or [6] above, and encoding a DNA polymerase-associated 
factor possessing an activity of enhancing DNA synthesizing-activity of a DNA polymerase; 
[8| a method for producing a DNA polymerase-associated factor, characterized in that the method comprises cul- 
turing a transformant harboring the gene of any one of items [5] to [7] above, and collecting a thermostable DNA 
polymerase-associated factor capable of enhancing DNA synthesizing-activity of a DNA polymerase from the cul- 

25 tured medium; 

[9] a method of DNA synthesis by using a DNA polymerase, characterized in that DNA is synthesized in the pres- 
ence of the DNA polymerase-associated facta of any one of items [1] to [4] above; 

[10] the method of DNA synthesis according to item [9] above, wherein DNA is synthesized in the presence of two 
or more kinds of DNA polymerase-associated factors; 
30 f 1 1] the method of DNA synthesis according to item [10] above, wherein DNA is synthesized in the presence of F7, 
PFU-RFC and PFU-RFCLS as a DNA polymerase-associated factor; 

[12] the method of DNA synthesis according to any one of items [9] to [1 1] above, wherein the DNA polymerase is 
a thermostable DNA polymerase; 

[13] the method of DNA synthesis according to item [12] above, wherein the synthesis is carried out by PCR 
35 method; 

[14] a kit usable for in vitro DNA synthesis, comprising the DNA polymerase-associated factor of any one of items 
[1] to [4] above and a DNA polymerase; 

[15] the kit according to item [14] above, further comprising a reagent required for DNA synthesis; 
[16] the kit according to item [14] or [15] above, comprising two or more kinds of DNA polymerase-associated fac- 
40 tors; 

[17] the kit according to item [16] above, comprising F7, PFU-RFC and PFU-RFCLS as a DNA polymerase-asso- 
ciated factor; and 

[18] the kit according to any one of items [14] to [17] above, comprising a thermostable DNA polymerase as a DNA 
polymerase. 

45 

PRIEF PESCRIPTIQN QF PRAWINGS 
[0015] 

so Figure 1 is a drawing showing SDS-PAGE of 7 kinds of proteins (F1 , F2, F3, F4, F5, F6 and FT) isolated by an anti- 
Pf u polymerase C antibody column. The molecular weights on SDS-PAGE are about 55 kDa, about 24 kDa, about 
37 kDa. about 19.5 kDa, about 27 kDa, about 64 kDa and about 33 kDa, in a sequential order of F1 to F7. 
Figure 2 is a restriction endonuclease map of a DNA insert of the plasrrrid pF1-4-10 carrying a gene encoding the 
F1 protein. 

55 Figure 3 is a graph showing a 5' -+ 3' exonuclease activity of the F1 protein. 
Figure 4 is a graph showing a 3' -> 5' exonuclease activity of the F1 protein. 

Figure 5 is a restriction endonuclease map of a DNA insert of the plasmid pF21 72Nh carrying a gene encoding the 
F2 protein. 
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Figure 6 is a restriction endonuclease map of a DNA insert of the plasmid pF7-1-8 carrying a gene encoding the 
F7 protein. 

Figure 7 is an autoradiogram showing a primer extension activity of the DNA polymerase when the F7 protein is 
added. 

5 Figure 8 is an autoradiogram showing a primer extension activity for the higher molecular primer extension reaction 
product of the DNA polymerase, when the F7 protein is added. 

Figure 9 is a restriction endonuclease map of a DNA insert of the plasmid pRFS254NdB carrying a gene encoding 
the PFU-RFC protein. 

Figure 10 shows the analytical results of SDS-PAGE of the protein (F7) isolated by an anti-Pfu DNA polymerase 
w antibody column. The molecular weight of F7 on SDS-PAGE is deduced to be about 33 kDa. 

Figure 1 1 shows the analytical results of DNA polymerase activity of the eluate obtained by subjecting to gel filtra- 
tion Pfu DNA polymerase and a mixture of Pf u DNA polymerase and F7. 

Figure 12 is a restriction endonuclease map of a DNA insert of the plasmid pRFLSNh carrying a gene encoding the 
PFU-RFCLS protein. 

75 Figure 13 is a restriction endonuclease map around the gene encoding the F5 protein on genomic DNA of Pyo- 
coccus furiosus. 

Figure 14 shows analytical results of SDS-PAGE of 3 kinds of proteins (PFU-RFCLS, PFU-RFC, F7) isolated by an 
anti-PFU-RFC antibody column. 

Figure 15 is a graph showing DNA polymerase activity when F7 or RFC-N complex is added. 
20 Figure 16 is a restriction endonuclease map of a DNA insert of the plasmid pRFCIO carrying genes encoding PFU- 
RFCLS and PFU-RFC. 

Figure 17 is a graph showing DNA polymerase activity, when F7, or F7 and rRFC-M complex are added. 
BEST MODE FOR CARRYING OUT THE INVENTION 

25 

1 . DNA Pdymerase-Associated Factor of the Present Invention 

[001 6] In the present specification, the term "DNA polymerase-associated factor" means a factor which has effects 
on a function of a DNA polymerase by coexisting with the DNA polymerase. Concretely, the DNA poiymerase-associ- 
30 ated factors include a factor possessing an action of enhancing the DNA synthesizing-activity of a DNA polymerase, a 
factor possessing an activity of binding to a DNA polymerase, and further one possessing both activities, and the like. 
In addition, the DNA polymerase-associated factor of the present invention is a thermostable protein, which is, for 
instance, stable against heat treatment at 80°C for 1 5 minutes. Therefore, the factor can be used for DNA synthesizing- 
reaction under high-temperature conditions using a thermostable DNA polymerase. 

35 

(a) DNA Polymerase-Associated Factor Capable of Enhancing DNA Syrrthesizing-Activrty of DNA Polymerase 

[001 7] The DNA polymerase-associated factor capable of enhancing DNA synthesizing-activity of a DNA polymer- 
ase is not particularly limited, as long as the factor is capable of enhancing DNA synthesizing-activity of a DNA polymer- 

40 ase. For instance, the factor includes proteins comprising an entire or partial sequence of amino acid sequence as 
shown in at least one sequence selected from the group consisting of SEQ ID NOs: 1 , 3, 19, 27, 34, 64, 70 and 80 in 
Sequence Usting; or functional equivalents thereof comprising an amino acid sequence resulting from substitution, 
deletion, addition or insertion of one or more amino acids in at least one of the amino acid sequences, and the equiva- 
lent possessing an activity of enhancing DNA synthesizing-activity of a DNA polymerase. In the present specification, 

45 the term "one or more" refers to a number of one or several or more. In addition, the term "functional equivalent" refers 
to ones which are substantially equivalent in their functions and activities even though they are structurally different, and 
the functional equivalents are also encompassed in the DNA polymerase-associated factor of the present invention. 
[0018] The DNA polymerase of which activity is enhanced by the DNA polymerase-associated factor of the present 
invention is not particularly limited. Examples thereof include thermostable DNA polymerases, in particular DNA 

so polymerases derived from hyperthermophilic archaebacterium. Concretely, there can be cited DNA polymerases 
derived from Pyrococcus furiosus (Pfu polymerase C, and the like mentioned below). As described below, the Pfu 
polymerase C is an enzyme comprising a DNA polymerase-constituting protein having the amino acid sequences as 
shown in SEQ ID NO: 5 and SEQ ID NO: 6 in Sequence Usting. 

[0019] In addition, the DNA polymerase-associated factor of the present invention may be one enhancing only an 
55 activity of a particular DNA polymerase, and it is preferably one enhancing its activities against a plural kinds of DNA 
polymerase from different origins. 

[0020] The method for determination of an activity of enhancing DNA synthesizing-activity of a DNA polymerase is 
not particularly limited, as long as it is one usually employed in the determination of DNA synthesizing-activity of a DNA 
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polymerase. The activity of enhancing DNA synthesizing-activity can be, for instance, determined by adding the factor 
when measuring an incorporation activity of the labeled nucleotide into a novel synthesized DNA strand; and comparing 
the incorporation activity with an activity when the factor is not added. In addition, there can be cited a method for con- 
firmation from the chain length of a novel synthetic DNA strand per unit time or from the amount of PCR amplified prod- 
5 uct per unit time. As the method for determination of the DNA synthesizing-activity, there can be cited a method 
described in DNA Polymerase from Escherichia co// ( published by Harpar and Row, edited by D.R. Davis, 263-276 
(authored by C.C. Richardson), and the like. 

[0021] Further, in the DNA polymerase-associated factor of the present invention, by a combination of a plurality of 
the DNA polymerase-associated factors, there can be exhibited an even higher DNA polymerase activity in the ooexist- 
w ent DNA polymerases when compared with that of the single use. 

(b) DNA Polymerase-Associated Factor Possessing Activity of Binding to DNA Polymerase 

[0022] The D N A polymerase-associated factor possessing an activity of binding to a DNA polymerase is not partic- 
15 ularly limited, as long as it possesses an activity of binding to a DNA polymerase. Incidentally, the DNA polymerase- 
associated factor possessing an activity of binding to a DNA polymerase in the present specification encompasses 
other substances, for instance, ones having an activity of indirectly binding to a DNA polymerase via other DNA 
polymerase-associated factors, as well as ones having an activity of directly binding to a DNA polymerase. Examples 
thereof include proteins comprising an entire or partial sequence of amino acid sequence as shown in at least one 
20 sequence selected from the group consisting of SEQ ID NOs: 1 , 3, 19. 27, 34, 64, 70 and 80 in Sequence Listing; or 
functional equivalents thereof comprising an amino acid sequence resulting from substitution, deletion, addition or 
insertion of one or more amino acids in at least one of the amino acid sequences, and the equivalent possessing an 
activity of binding to a DNA polymerase. In the present specification, the term "one or more" refers to a number of one 
or several or more. 

25 [0023] The DNA polymerase binding to the DNA polymerase-associated factor of the present invention, which is 
not particularly limited, includes, for instance, a thermostable DNA polymerase, in particular DNA polymerases derived 
from hyperthermophilic archaebacterium. Concretely, there can be cited DNA polymerases derived from Pyrococcus 
furiosus (Pfu polymerase C, and the like). One or both of the DNA polymerase-constituting proteins having the amino 
acid sequences as shown in SEQ ID NO: 5 and SEQ ID NO: 6 in Sequence Listing are bound to Pfu polymerase C. 

30 [0024] In addition, the DNA polymerase-associated factor of the present invention may be one binding to a partic- 
ular DNA polymerase, and it is preferably one binding to a plural kinds of DNA polymerase from different origins. 
[0025] The method for determination of the binding to a DNA polymerase includes a method comprising mixing the 
factor with a DNA polymerase, and examining a change in the molecular weight by native gel electrophoresis, gel filtra- 
tion, and the like; a method for examining the adsorption of the factor to a carrier immobilized to a DNA polymerase, 

35 and the like. 

[0026] In addition, the DNA polymerase-associated factor comprising the amino acid sequence as shown in SEQ 
ID NO: 19 in Sequence Listing possesses an exonuclease activity. Therefore, it is considered that the DNA polymerase- 
associated factor comprising the amino acid sequence as shown in SEQ ID NO: 19 is a protein having a function asso- 
ciated with the action of a DNA polymerase in DNA replication, DNA repair, and the like. Further, as the functional 

40 equivalents of the DNA polymerase-associated factor, proteins comprising a partial sequence of the amino acid 
sequence as shown in SEQ ID NO: 1 9 in Sequence Listing, or an amino acid sequence resulting from substitution, dele- 
tion, addition or insertion of one or more amino acids in at least one of the sequences, wherein the proteins possess an 
activity of binding to a DNA polymerase, and further similarly possess an exonuclease activity are encompassed in the 
present invention as the DNA polymerase-associated factor. In the present specification, the term "one or more" refers 

45 to a number of one or several or more. 

[0027] Incidentally, in the explanation of the DNA polymerase-associated factor of the present invention, the factor 
is identified as a protein comprising an entire or partial sequence of each of the amino acid sequences as shown in par- 
ticular SEQ ID NO in Sequence Listing, and the term "protein comprising" as used herein encompasses proteins as 
described below, which are also encompassed in the present invention. Namely, when a protein is produced by genetic 

50 engineering techniques, it is often expressed as a fusion protein. For instance, in order to increase an expression level 
of the desired protein, the protein is expressed by adding a N-terminal peptide chain derived from other proteins to the 
N-terminus, or expressed by adding an appropriate peptide chain at N-terminus or C-terminus of the desired protein, 
and a carrier having affinity with each of the peptide chain is used, whereby facilitating the purification of the desired 
protein. In the present invention, the fusion proteins mentioned above are also encompassed. 

55 
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2. Genes Encoding DNA Polymerase-Associated Factor of the Present Invention 

(a) Properties of Genes Encoding DNA Polymerase-Associated Factor of the Present Invention 

5 [0028] The genes encoding the DNA polymerase-associated factor of the present invention are those encoding the 
DNA polymerase-associated factor of the present invention mentioned above, which refers to DNA or RNA. Concretely, 
the gene includes a gene encoding a DNA polymerase-associated factor, wherein the factor comprises an entire or par- 
tial sequence of amino acid sequence as shown in at least one sequence selected from the group consisting of SEQ 
ID NOs: 1, 3, 19, 27, 34, 64, 70 and 80 in Sequence Listing, or an amino acid sequence resulting from substitution, 

10 deletion, addition or insertion of one or more amino acids in at least one of these sequences, and the factor possesses 
an activity of enhancing DNA synthesizing-activrty of a DNA polymerase, or an activity of binding to a DNA polymerase. 
Concrete examples of such genes include genes encoding a DNA polymerase-associated factor, comprising an entire 
or partial sequence of nucleotide sequence as shown in at least one sequence selected from the group consisting of 
SEQ ID NOs: 2, 4, 18, 26, 33, 63, 69 and 79, or a nucleotide sequence resulting from substitution, deletion, addition or 

75 insertion of one or more bases in these sequences, wherein the factor possesses an activity of enhancing DNA synthe- 
sizing-activity of a DNA polymerase, or an activity of binding to a DNA polymerase. In the present specification, the term 
"one or more** refers to a number of one or several or more. In the present invention, there can be further cited a gene 
capable of hybridizing to a DNA of the gene of the present invention, and possessing an activity of enhancing DNA syn- 
thesizing-activity, or an activity of binding to a DNA polymerase. 

20 [0029] The term "gene capable of hybridizing (to a gene)" described in the present specification refers to a gene 
comprising a DNA capable of hybridizing to a DNA of a gene, which is a gene having a nucleotide sequence resembling 
to the gene. With regard to the gene having a nucleotide sequence resembling to a gene, there is a high possibility of 
having resemblance to an amino acid sequence of a protein encoded thereby, and additionally having resemblance to 
a function of the protein. The homology of the nucleotide sequence of the gene can be examined by whether or not a 

25 hybrid is formed (the genes being hybridized) with DNAs of both genes or a partial portion thereof under stringent con- 
ditions. By utilizing hybridization, a gene encoding a protein having similar functions to a protein encoding the gene can 
be obtained. In other words, the other genes of the present invention having homologous nucleotide sequences to a 
gene of the present invention can be obtained by carrying out hybridization by a known method using a DNA of the gene 
obtained in the present invention, or a partial portion thereof, as a probe. The hybridization can be carried out for 

30 instance, by a method described in Molecular Cloning: A Laboratory Manual, 2nd Ed., published by Cold Spring Harbor 
Laboratory in 1989. edited by T. Maniatis et al., or the like. 

[0030] Here, the term "the stringent conditions" refers to conditions in which non-specific hybridization does not 
take place. Concretely, for instance, there are the following conditions. In other words, a DNA-immobilized membrane 
is incubated at 50°C for 12 to 20 hours together with a labeled DNA probe in 6 x SSC (wherein 1 x SSC shows 0.15 M 

35 NaCI. 0.015 M sodium citrate. pH 7.0) containing 0.5% SDS. 0.1% bovine serum albumin (BSA), 0.1% polyvinyl pyrro- 
lidone. 0.1% Ficol 400, and 0.01% denatured salmon sperm DNA. After termination of the incubation, the membrane is 
washed, initiating under the conditions of 37°C in 2 x SSC containing 0.5% SDS, the SSC concentration being made 
variable up to a range of 0.1 x SDS, and the temperature being variable up to a range of 50°C, until a signal ascribed 
to an immobilized labeled DNA probe can be distinguished from the background. 

40 [0031] In addition, instead of hybridization, there can be utilized a method for gene amplification using a partial 
sequence of the nucleotide sequence of the gene of the present invention as a primer. For instance, PCR method can 
be utilized. The PCR conditions can be appropriately set by sequences of primer DNAs or a template DNA. Whether or 
not the gene obtained as described above encodes a protein having the desired function can be examined by confirm- 
ing the activity of the resulting protein by expressing a protein encoded by the gene using an appropriate host and an 

45 expression system. 

[0032] In addition, the method for artificially preparing an amino acid sequence or nucleotide sequence having sub- 
stitution, deletion, addition, or insertion of one or more in the amino acid sequence or nucleotide sequence in the 
present invention includes various genetic engineering manipulations described in Molecular Cloning: A Laboratory 
Manual, 2nd Ed., published by Cold Spring Harbor Laboratory in 1989, edited by T. Maniatis et al. t or the like. Concrete 

so examples thereof include genetic engineering techniques such as methods for sitedirected mutagenesis and cassette 
mutation methods. By the method for site-directed mutagenesis, an amino acid sequence or nucleotide sequence hav- 
ing one or more substitution, deletion, addition or insertion can be prepared. By the cassette mutation method, there 
can be prepared an amino acid sequence or nucleotide sequence having a larger region of deletion, addition or inser- 
tion as compared with the sequence obtained by the method for site-directed mutagenesis. These modified products 

55 described above are also encompassed in the present invention as long as they are functionally equivalent Further, in 
the production of a protein by genetic engineering techniques, in a case where a codon used on a naturally occurring 
gene encoding the desired protein is used at a low frequency, the expression level of the protein may be low. In such a 
case, the codon is artificially converted to one frequently used in the host without changing the encoded amino acid 
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sequence, whereby the desired protein is highly expressed (for instance, Japanese Examined Patent Publication No. 
Hei 7-102146). 

(b) Cloning of Gene Encoding DNA Polymerase-Associated Factor of the Present Invention 

5 

[0033] Detailed descriptions on the analysis of the resulting clones, the physicochemical properties of the expres- 
sion product DNA polymerase -associated factor, the elucidation of the functions, and the like will be given hereinbelow. 
[0034] As described above, the DNA polymerase-associated factor of the present invention possesses an action of 
enhancing DNA synthesizing-activity of a DNA polymerase, or a characteristic of binding the factor to a DNA polymer- 
jo ase. Therefore, the factor can be obtained by using these actions as indices. 

[0035] The DNA polymerase utilizable in the obtainment of the DNA polymerase-associated factor of the present 
invention is not particularly limited, and an example thereof includes a Pyrococcus foriasus-producing DNA polymer- 
ase. As the Pyrococcus furiosus -producing DNA polymerase, for instance, there can be used an enzyme comprising a 
DNA polymerase-constituting protein comprising the amino acid sequence as shown in SEQ ID NO: 5 and/or SEQ ID 
is NO: 6 in Sequence Listing, derived from Pyrococcus furiosus DSM3638. 

[0036] Incidentally, in the present specification, this enzyme is described as Pfu polymerase C, in order to distin- 
guish with a type DNA polymerase [Pfu DNA polymerase, Nucleic Acids Research, 21, 259-265 (1993)], which has 
been also found from Pyrococcus furiosus. The gene encoding the enzyme is carried by plasmid pFU1001 . In addition, 
a transformant, Escherichia coti JM109 transformed with the plasmid, is named and identified as Escherichia coii 
20 JM1094JFU1001, and deposited under the accession number of FERM BP-5579 with the National Institute of Bio- 
science and Human-Technology, Agency of Industrial Science and Technology, Ministry of International Trade and 
Industry, of which the address is 1-3, Higashi 1-chome, Tsukuba-shi, Ibaraki-ken (Zipcode 305-8566), Japan, since 
August 1 1 , 1995 (date of original deposit) under the Budapest Treaty. Therefore. Pfu polymerase C can be obtained by 
culturing the transformant and purifying from the resulting cultured medium. Incidentally, Pfu polymerase C is an 
25 enzyme comprising a DNA polymerase-constituting protein having the amino acid sequence as shown in SEQ ID NO: 
5 and/or SEQ ID NO: 6 in Sequence Listing. 

[0037] Pfu polymerase C is an enzyme possessing the following properties: 

(A) exhibiting a higher activity when the polymerase activity is determined by using as a substrate a complex resutt- 
30 ing from annealing of a primer to a single stranded template DNA, as compared to the case where an activated 

DNA is used as a substrate: 

(B) possessing a 3'->5' exonuclease activity; 

(C) being capable of amplifying a DNA fragment of about 20 kbp without adding other enzymes, in the case where 
polymerase chain reaction (PCR) is carried out with X-DNA as a template under the following conditions: PCR con- 

35 ditions: 

a) a composition of reaction mixture: comprising 10 mM Tris-HCI (pH 9.2), 3.5 mM MgQ 2 . 75 mM KCJ . 400 H M 
each of dATP, dCTP, dGTP and dTTP, 0.01% bovine serum albumin. 0.1% Triton X-100, 5.0 ng/50 jil X-DNA. 
10 pmole/50 til primer X1 (SEQ ID NO: 58 in Sequence Listing), primer X1 1 (SEQ ID NO: 59 in Sequence List- 

40 ing), and 3.7 units/50 nl DNA polymerase; 

b) reaction conditions: carrying out PCR for 30 cycles, wherein one cycle is 98°C, 10 seconds - 68°C, 10 min- 
utes; and 

(D) comprising two kinds of DNA polymerase-constituting proteins corresponding to about 90,000 daltons and 
45 about 1 40,000 daltons on SDS-PAGE, respectively. 

[0038] The method of obtaining the DNA polymerase-associated factor of the present invention is not particularly 
limited. For instance, the factor can be obtained by immobilizing a DNA polymerase, such as Pfu polymerase C, to an 
appropriate carrier, mixing the DNA polymerase-immobilized carrier with a sample containing the DNA polymerase- 

50 associated factor, removing the factor unbound to the carrier, and thereafter eluting the bound carrier. The immobiliza- 
tion of the DNA polymerase to the carrier can be carried out by a known method. Alternatively, an antibody against the 
DNA polymerase is prepared, and a DNA polymerase may be immobilized by utilizing the antibody-immobilized carrier. 
For instance, when an anti-Pf u polymerase C antibody is prepared, and the DNA polymerase-associated factor of the 
present invention is obtained by using the antibody from a sample derived from Pyrococcus furiosus, including, for 

55 instance, a cell disrupted solution of Pyrococcus furiosus, Pfu polymerase C in the sample binds to this antibody when 
the antibody-immobilized carrier as described above is used. Therefore, it is not necessary to add Pfu polymerase C 
aside from the sample, so that the DNA polymerase-associated factor can be readily purified. 
[0039] The sample used in the obtainment of the DNA polymerase-associated factor of the present invention is not 
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particularly limited. For instance, there can be used samples derived from microorganisms. Concretely, samples 
derived from Pyrococcus furiosus DSM 3638 can be used. The above strain can be made available from Deutsche 
Sammlung von Mikroorganismen und Zellkulturen GmbH. In the case of a cell disrupted solution obtained by culturing 
the above strain in an appropriate growth medium and preparing from the resulting cultured medium is applied to a col- 
5 umn packed with a carrier immobilized with an anti-Pfu polymerase C antibody, several kinds of proteins other than Pfu 
polymerase C are adsorbed to the column. The gene encoding these proteins can be cloned by the procedures exem- 
plified below. 

[0040] First the above proteins are isolated by a known method, and their N-terminal amino acid sequences are 
determined. In reference to the amino acid sequences, synthetic oligonucleotides to be used as primers or probes are 

10 prepared. Next. PCR is carried out with a genomic DNA of Pyrococcus furiosus as a template using this synthetic oli- 
gonucleotide as a primer, whereby a DNA fragment carrying the desired gene can be obtained. The conditions for PCR 
may be appropriately set. Alternatively, a DNA fragment carrying the desired gene can be obtained from a genomic 
DNA of Pyrococcus furiosus by carrying out hybridization using the above oligonucleotide as a probe. In this case, as 
the hybridization, there can be employed Southern hybridization using a genomic DNA of Pyrococcus furiosus obtained 

75 by digesting with an appropriate restriction enzyme, colony hybridization using a gene library of a genomic DNA of 
Pyrococcus furiosus, plaque hybridization, dot hybridization, and the like. 

[0041 ] When the DNA fragment as obtained above does not carry a full length of the desired gene, new primers are 
prepared in reference to the nucleotide sequence of the resulting DNA fragment, and PCR is further carried out, or 
hybridization is carried out using the resulting DNA fragment or its partial fragment as a probe, whereby a full length of 
20 the desired gene can be obtained. 

[0042] The manipulations for the PCR and hybridization are not particularly limited, and for instance, they can be 
carried out in reference to Molecular Cloning: A Laboratory Manual, 2nd Ed., published by Cold Spring Harbor Labora- 
tory in 1989, edited by T Maniatis et al. 

[0043] When the cell disrupted solution of the strain Pyrococcus furiosus DSM 3638 is mixed with the above carrier 

25 immobilized with the anti-Pfu polymerase C antibody, there are seven kinds of proteins adsorbed to the carrier as well 
as Pfu polymerase C. With respect to six kinds among them, in the present invention, their genes have been isolated 
by the above described manipulations. These proteins are named F1, F2, F3. F4, F5 and F7, respectively, which are 
the concrete examples of the DNA polymerase-associated factor of the present invention. The nucleotide sequences of 
an open reading frame of the gene encoding these proteins are shown in SEQ ID NOs: 1 8, 26, 79, 33, 69 and 2, respec- 

30 tively, in Sequence Listing. In addition, the amino acid sequences of each protein deduced from these nucleotide 
sequences are shown in SEQ ID NOs: 1 9, 27, 80, 34, 70 and 1 , respectively, in Sequence Listing. 
[0044] The cloned gene is introduced into an appropriate host, for instance, Escherichia coli, whereby allowing to 
express a protein encoded thereby. For instance, a transfbrmant of Escherichia coli JM109, into which a gene encoding 
F7 mentioned above is introduced, is named and identified as Escherichia coli JM1 09/|p>F7-HH-1 8, and deposited under 

35 the accession number of FERM BP-6338 with the National Institute of Bioscience and Human-Technology. Agency of 
Industrial Science and Technology. Ministry of International Trade and Industry, of which the address is 1-3, Higashi 1- 
chome, Tsukuba-shi, Ibaraki-ken (Zipcode 305-8566), Japan, since June 3, 1997 (date of original deposit) under the 
Budapest Treaty. F7 can be obtained by culturing the transformant, and recovering a desired product from the resulting 
culture, ft is elucidated in the present invention that the F7 as obtained above enhances activities of a type polymerase 

40 (Pfu DNA polymerase) derived from Pyrococcus furiosus and two kinds of DNA polymerases [J. BacterioL, 177, 2164- 
2177 (1995)] derived from Pyrodictium occultum, in addition to Pfu polymerase C used in protein isolation. 
[0045] In addition, there are also elucidated that each of F1, F2, F3, F4 and F5 mentioned above enhances an 
activity of Pfu polymerase C and Pfu DNA polymerase. 

[0046] When the amino acid sequence of the protein derived from the above strain Pyrococcus furiosus DSM 3638 
45 is compared with an amino add sequence of a known protein, F1 has homologies to a single-stranded DNA-specific 
exonuclease derived from Haemophilis influenzae [Science, 269, 496-512 (1995)]. F3 has homologies to Mycoplana 
rarnosa-derived acetylpolyamine aminohydrase [Journal of Bacteriology, 178, 5781-5786 (1996)] and human histone 
deacetylase [Science, 272, 408-411 (1996)]. In addition, F7 has homologies to the proliferating cell nuclear antigen 
(PCNA) involved in the DNA replication in eukaryotes [EMBOJ., 11, 5111-5120 (1995); Nucleic Acids Research, 18, 
so 1363-1381 (1990); Proc. Natl. Acad. Sci. USA, 84, 1575-1579 (1987)]. F2, F4 and F5 have not been found to have 
homologies to a known protein. 

[0047] There has been reported that PCNA forms a complex with a replication factor C (RFC, RF-C) to be involved 
in DNA synthesis [Journal of Biochemistry, 68, 1542-1548 (1996)]. Therefore, even in Pyrococcus furiosus, it is 
expected that a protein corresponding to RFC is expressed, and that the protein is involved in DNA synthesis reaction 
55 together with F7 mentioned above. A further excellent effect of enhancing DNA polymerase syrrthesizing-activity can be 
obtained by collecting this protein, and for instance, adding the resulting protein together with F7 mentioned above in 
the reaction system for DNA polymerase. The gene encoding an RFC homolog of Pyrococcus furiosus can be obtained 
by the steps described below. 
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[0048] An entire nucleotide sequence of chromosomal DNA of archaebacteria Methanococcus jannaschii has been 
already elucidated (Science, 273, 1058-1073 (1996)], and the nucleotide sequences carry the gene encoding a protein 
which is considered to be a homolog of PCNA and RFC. The amino acid sequence encoded by the gene of a homolog 
of RFC small subunit and large subunit of the strain is compared with the amino acid sequence encoded by a known 

5 RFC small subunit gene [Nucleic Acids Research, 21, 1-3 (1993); Nucleic Acids Research, 22, 1527-1535 (1994)], 
thereby examining for the amino acid sequences of high homologies. A synthetic oligonucleotide can be prepared in 
reference to the above, the oligonucleotide usable as a primer or probe for obtaining a gene fragment encoding RFC 
small subunit and large subunit Subsequently, by the manipulations employed for the obtainment of the gene encoding 
any one of F1 to F7 mentioned above using the oligonucleotide, there can be obtained, for instance, a gene encoding 

10 PFU-RFC, which is a homolog of RFC small subunit, and a gene encoding PFU-RFCLS. which is a homolog of RFC 
large subunit, each derived from Pyrococcus furiosus. 

[0049] The nucleotide sequence of the gene encoding the PFU-RFC obtained as above is determined, and an 
amino acid sequence deduced to be encoded thereby is examined, and the amino acid sequence is compared with the 
amino acid sequence of a known RFC small subunit As a result, there has been elucidated that an intervening 

15 sequence (intein) is present in the amino acid sequence. 

[0050] A region corresponding to intein is eliminated from the gene, whereby a gene comprising PFU-RFC in an 
expressible state can be obtained. The nucleotide sequence of an open reading frame of a region encoding PFU-RFC 
in the gene and the amino acid sequence of PFU-RFC deduced from the nucleotide sequence are shewn in SEQ ID 
NOs: 4 and 3, respectively, in Sequence Listing. In addition, the nucleotide sequence of an open reading frame encod- 

20 ing PFU-RFCLS in the PFU-RFCLS gene and the amino acid sequence of the protein encoded thereby are shown in 
SEQ ID NOs: 63 and 64, respectively, in Sequence Listing. Both of these proteins are also one of concrete examples 
of the DNA polymerase-associated factor of the present invention. 

[0051] Further, a plasmid to be used for expression of PFU-RFC can be constructed by using the gene. Such an 
expression plasmid includes plasmid pRFS254SNc. In addition, a transfbrmant of Escherichia coli JM109, into which 

25 the plasmid is introduced, is named and identified as Escherichia coli JM1094>RFS254SNc, and deposited under the 
accession number of FERM BP-6339 with the National Institute of Bioscience and Human-Technology, Agency of 
Industrial Science and Technology, Ministry of International Trade and Industry, of which the address is 1-3, Higashi 1- 
chome, Tsukuba-shi, Ibaraki-ken (Zipcode 305-8566), Japan, since June 3, 1997 (date of original deposit) under the 
Budapest Treaty. PFU-RFC can be obtained by culturing the transformant, and collecting from the resulting culture. 

30 With regard to PFU-RFC, it is observed that the PFU-RFC enhances an activity of a DNA polymerase when used alone, 
and that the PFU-RFC exhibits synergistic effects in enhancing actions as compared to a case where each protein is 
added alone when used in combination of F7 above. 

[0052] In addition, a transformant resulting from introduction of both PFU-RFC gene and PFU-RFCLS gene is pre- 
pared, whereby a complex formed with PFU-RFC and PFU-RFCLS (hereinafter referred to as "holo-RFC"; in particular, 
35 holo-RFC produced by genetic engineering is referred to as w rRFC-M complex") can be expressed. The complex is 
capable of enhancing an activity of a DNA polymerase, which particularly shows high effects when used in combination 
with F7 mentioned above. 

[0053] The above PFU-RFC and PFU-RFCLS can be further allowed to enhance a DNA polymerase activity by 
using a mixture with F7. In this case, a mixture of the holo-RFC (or rRFC-M complex) with F7 may be used, or a complex 

40 formed by PFU-RFC, PFU-RFCLS and F7 (RFC-N complex) may be used. 

[0054] As explained above, the present invention provides a DNA polymerase-associated factor capable of enhanc- 
ing DNA synthesizing-activity of a DNA polymerase, and a gene encoding the factor. The factor can be produced by 
genetic engineering by utilizing the gene. Further, a gene encoding a protein having an equivalent function with the 
DNA polymerase-associated factor of the present invention can be also obtained by genetic engineering techniques by 

45 utilizing the gene. 

[0055] The DNA polymerase-associated factor of the present invention comprises a known protein involved in the 
DNA synthesis reaction as described above. Examples of such known proteins include ones homologous to proteins 
such as PCNA and RFC derived from eukaryotes. It has been said that these proteins such as PCNA and RFC form a 
complex to be involved in the DNA synthesis reaction with DNA polymerase 8 [Journal of Biochemistry, 68, 1542-1 548 
so (1 996)]. However, the DNA polymerase-associated factor disclosed in the present invention is capable of enhancing an 
activity of a DNA polymerase with not only the complex, but also individual factors atone. Also, the factor exhibits an 
effect on a DNA polymerase which is structurally different from DNA polymerase 5. 

[0056] The present invention can be utilized in various processes utilizing a DNA polymerase, including, for 
instance, nucleotide sequencing for DNA, DNA labeling, DNA amplification by PCR, and the like. The DNA polymerase- 
55 associated factor of the present invention is added to a reaction system for a DNA polymerase, whereby particularly 
showing an improvement in an activity of extension of DNA strand from the primer. In addition, since the factor has a 
high thermostability, H can be utilized for PCR, particularly for PCR in which an amplification of a long chain DNA is 
desirable. 
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[0057] Further, among the DNA polymerase-associated factors of the present invention, ones having an activity of 
binding to a DNA polymerase can be used for detection, purification, and the like, of the DNA polymerase. For instance, 
the factor can efficiently purify the bound DNA polymerase by subjecting it to affinity chromatography using a carrier to 
which the DNA polymerase-associated factor of the present invention is bound. 

5 

3. Method for Producing DNA Polymerase- Associated Factor of the Present Invention 

[0058] One of the features of the method for producing a DNA polymerase-associated factor of the present inven- 
tion resides in that the method comprises culturing a transformarrt harboring the gene of the present invention, and col- 

10 lecting from the cultured medium a thermostable DNA polymerase-associated factor capable of enhancing DNA 
synthesizing-activity of a DNA polymerase, or possessing an activity of binding to a DNA polymerase. 
[0059] In the method for producing a DNA polymerase-associated factor of the present invention, a generally 
employed method for purification of proteins can be applied. For instance, a DNA encoding the DNA polymerase-asso- 
ciated factor of the present invention is ligated to an expression vector, whereby being overexpressed under the control 

15 of a promoter of the expression vector. In addition, the DNA polymerase-associated factor of the present invention can 
be easily collected from a transformarrt harboring the gene of the present invention by a process comprising ligating a 
DNA encoding the DNA polymerase-associated factor of the present invention to a DNA encoding a protein such as glu- 
tathione reductase and p-galactosidase or to a DNA encoding histidine tag, to be expressed as a fusion protein. The 
fusion protein mentioned above can be easily isolated by using usually employed affinity column chromatography, such 

20 as nickel column. In the fusion protein mentioned above, the DNA polymerase-associated factor can be separated from 
a protein such as glutathione reductase or p-galactosidase by a conventional method. 

[0060] In addition, the expressed DNA polymerase-associated factor of the present invention can be obtained in the 
same manner as the method for obtaining the DNA polymerase-associated factor of the present invention from Pyo- 
coccus furiosus, the method comprising immobilizing a DNA polymerase, such as Pfu polymerase C, to an appropriate 
& carrier, mixing the DNA polymerase-immobilized carrier with a sample containing the DNA polymerase-associated fac- 
tor, removing ones unbound to the carrier, and eluting one bound thereto. 

4. Method of DNA Synthesis 

30 [0061 ] One of the great features of the method of DNA synthesis of the present invention resides in that a DNA is 
synthesized using a DNA polymerase in the presence of the DNA polymerase-associated factor of the present invention 
mentioned above. In the method of DNA synthesis of the present invention, a DNA is synthesized using a DNA polymer- 
ase in the presence of the DNA polymerase-associated factor of the present invention, whereby a long chain DNA of 
about 20 kb can be amplified. 

35 [0062] The DNA polymerase-associated factor usable in the method of DNA synthesis of the present invention 
includes F1, F2, F3, F4, F5, F7, PFU-RFC, PFU-RFCLS and the like. In the method of DNA synthesis of the present 
invention, the DNA polymerase-associated factor may be used alone or in admixture of two or more kinds. In the 
method of DNA synthesis of the present invention, an even longer DNA fragment can be synthesized as compared with 
the length of the DNA fragment obtained in the conventional method of DNA synthesis by, for instance, using three 

40 kinds of the DNA polymerase-associated factors F7, PFU-RFC and PFU-RFCLS. In the method of DNA synthesis of 
the present invention, the three kinds of the DNA polymerase-associated factors may be used by mixing the three kinds 
each supplied singly, or they may be used in admixture two kinds of F7 and holo-RFC constituted by PFU-RFC and 
PFU-RFCLS (rRFC-M complex). Further, the three kinds of the DNA polymerase-associated factors may be used as a 
complex constituted by F7, PFU-RFC and PFU-RFCLS (RFC-N complex). 

45 [0063] The DNA polymerase used in the method of DNA synthesis of the present invention includes DNA polymer- 
ases such as pol I derived from E. coir, and thermostable DNA polymerases such as Tth DNA polymerase derived from 
Thermus thermophilus, Taq DNA polymerase derived from Thermus aquaticus, and Pfu DNA polymerase derived from 
Pyrococcus furiosus. 

[0064] In addition, in the method of DNA synthesis of the present invention, a DNA can be synthesized by PCR 
so method using the DNA polymerase mentioned above. 

[0065] In the method of DNA synthesis of the present invention, the amount of the DNA polymerase-associated fac- 
tor of the present invention to be present is not particularly limited, and an amount sufficient for exhibiting an activity of 
enhancing synthesizing-activity of the DNA polymerase may be used. 

55 5. Kit Comprising DNA Polymerase-Associated Factor of the Present Invention 

[0066] The DNA polymerase-associated factor of the present invention can be utilized in various reactions in which 
a DNA polymerase is used. Therefore, the DNA polymerase-associated factor of the present invention is attached to a 



10 



EP 0 997 530 A1 



kit usable for in vitro DNA synthesis, including, for instance, a kit for nucleotide sequencing of DNA by the dideoxy 
method, a kit for DNA labeling, a PCR kit, whereby improving the performance of each of these kits. Besides ones con- 
taining the DNA polymerase and the DNA poiymerase-associated factor of the present invention, the kit as described 
above may comprise a reagent required for the reaction of a DNA polymerase, the reagent including, for instance, dNTP 

5 and MgCI 2 . The DNA poiymerase-associated factor contained in the kit of the present invention includes F1 , F2, F3, F4, 
F5, F7, PFU-RFC and PFU-RFCLS. In the kit of the present invention, the DNA poiymerase-associated facta maybe 
used alone or in admixture of two or more kinds. It is preferable to use three kinds of the DNA poiymerase-associated 
factors F7, PFU-RFC and PFU-RFCLS. Each of the three kinds of the DNA poiymerase-associated factors may be used 
by mixing each of the three kinds supplied singly. Also, there may be used in admixture of two kinds F7 and holo-RFC 

10 constituted by PFU-RFC and PFU-RFCLS (rRFC-M complex). Further, the three kinds of the DNA poiymerase-associ- 
ated factors may be used as a complex constituted by F7, PFU-RFC and PFU-RFCLS (RFC-N complex). The DNA 
polymerase contained in the kit of the present invention also includes DNA polymerases such as pol I derived from E. 
co//; and thermostable DNA polymerases such as Tth DNA polymerase derived from Thermus thermophilus. Tag DNA 
polymerase derived from Thermus aquaticus, Pfu DNA polymerase derived from Pyrococcus furiosus. In the kit of the 

is present invention, it is preferable that the kit comprises a thermostable DNA polymerase. The kit of the present inven- 
tion is used for the method of DNA synthesis, whereby a high molecular DNA can be synthesized more simply. 

EXAMPLES 

20 [0067] The present invention is hereinafter described by means of the following examples, but the scope of the 
present invention is not limited only to those examples. 

Example 1 

25 (1) Preparation of Pyrococcus furiosus Genomic DNA 

[0068] Pyrococcus furiosus DSM3638 was cultured in the following manner. 

[0069] A medium having a composition comprising 1% trypton, 0.5% yeast extract, 1% soluble starch, 3.5% Jama- 
rin S Solid (manufactured by Jamarin Laboratory), 0.5% Jamarin S Liquid (manufactured by Jamarin Laboratory), 

30 0.003% MgS0 4 , 0.001% Nad, 0.0001% FeS0 4 • 7H2O, 0.0001% C0SO4, 0.0001% CaCfe * ?H 2 0, 0.0001% ZnS0 4 , 
0.1 ppm CuSCVSH^O, 0.1 ppm KAI(S0 4 ) 2 , 0.1 ppm H3BO3, 0.1 ppm Na 2 Mo0 4 • 2H2O, and 0.25 ppm NiCI 2 *6H 2 0 
was placed in a two-liter medium bottle and sterilized at 120°C for 20 minutes. After sparging with nitrogen gas thereinto 
for removal of dissolved oxygen, the above strain was inoculated into the resulting medium. Thereafter, the medium was 
cultured by allowing to stand at 95°C for 16 hours. After termination of the cultivation, cells were harvested by centrifu- 

35 gation. 

[0070] The harvested cells were then suspended in 4 ml of 0.05 M Tris-HCI (pH 8.0) containing 25% sucrose. To 
this suspension, 0.8 ml of lysozyme [5 mg/ml, 0.25 M Tris-HCI (pH 8.0)] and 2 ml of 0.2 M EDTA were added, and the 
resulting mixture was incubated at 20°C for 1 hour. Thereafter, 24 ml of an SET solution [150 mM NaCI, 1 mM EDTA, 
and 20 mM Tris-HCI (pH 8.0)] was added thereto, and 4 ml of 5% SDS and 400 nl of proteinase K (10 mgArtl) were fur- 
40 ther added to the resulting mixture. Thereafter, the resulting mixture was reacted at 37°C for 1 hour. After termination 
of the reaction, phenol-chloroform extraction and subsequent ethanol precipitation were carried out to prepare about 
3.2 mg of genomic DNA. 

(2) Preparation of Cosmid DNA Library 

45 

[0071] Four hundred micrograms of the genomic DNA from Pyrococcus furiosus DSM3638 was partially digested 
with Sau3A1 and fractionated by size into 35 to 50 kb fractions by density gradient ultracentrrfugation method. Next, 1 
ng of triple helix cosmid vector (manufactured by Stratagene) was digested with Xba\, and thereafter dephosphorylated 
using an alkaline phosphatase (manufactured by Takara Shuzo Co., Ltd.), and further digested with BamHI. The result- 

50 ing treated vector was mixed with 140 ^g of the above 35 to 50 kb DNA fractions, and the mixture was subjected to liga- 
tion reaction. The cosmid carrying the genomic DNA fragment from Pyrococcus furiosus was packaged into lambda 
phage particles by in vitro packaging method using the resulting reaction mixture and "GIGAPACK GOLD" (manufac- 
tured by Stratagene), to prepare cosmid library. Subsequently, a portion of this library was transduced into E co// 
DHSaMCR (manufactured by BRL). Five hundred clones were selected from the resulting transformants, each named 

55 as Cosmid Clone Na 1 to No. 500. Further, a cosmid DNA was prepared from each of these clones. Several of them 
out of the resulting cosmid DNAs were selected and digested with a restriction enzyme to confirm the presence of an 
insert of an appropriate size. 
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(3) Cloning of Pfu Polymerase C Gene 

[0072] There was prepared as a reaction solution 20 mM Tris-HCI (pH 7.7), 2 mM MgCI 2 , 2 mM 2-mercaptoethanol, 
0.2 mg/ml activated DNA, 40 jiM each of dATP, dCTP, dGTP and dTTP, 60 nM [ 3 H)-dTTP (manufactured by Amersham). 

s To 45 *il of the reaction solution was added a 1 \A extract in 5 clone equivalent (5 jil) derived from each clone of the 
above cosmid DNA library, and the mixture was reacted at 75°C for 15 minutes. Thereafter, a 40 ^ aliquot of this reac- 
tion mixture was then spotted onto DE paper and washed with 5% Na 2 HP0 4 five times. The remaining radioactivity on 
the DE paper was determined using a liquid scintillation counter. Primary determination was carried out with one group 
consisting of 5 clones. The group found to have some activities was subsequently separated into one clone each from 

10 the 5 clones, and secondary determination was then carried out. Since it had been already known from a hybridization 
test with the gene as a probe that those clones in the cosmid DNA library containing a known DNA polymerase gene 
were Clone Nos. 57, 154, 162 and 363, there were obtained five clones of Clone Nos. 41, 153, 264, 462 and 491 pos- 
sessing DNA synthesizing-activity other than those clones. 

[0073] Cosmids were isolated from the above five clones, and each isolated cosmid was digested with BamHl 
is When examining the resulting electrophoretic patterns, there were found several mutually common bands, predicting 
that those five clones recombine regions with overlaps and slight shifts. With this finding in mind, the restriction endo- 
nudease map was prepared for the DNA inserts in Clone Nos. 264 and 491. On the basis of the resulting restriction 
endonuclease map, various DNA fragments of 10 kbp or so in length were cut out from the cosmid derived from Clone 
264 or 491 . The fragments were then subcloned into pTV1 1 8N or pTV1 19N vector (manufactured by Takara Shuzo Co., 
20 Ltd.). The thermostable DNA polymerase activity was measured for the resulting transformant harboring the recom- 
binant plasmid obtained. As a result, it was found that a gene for producing a highly thermostable DNA polymerase was 
present on an Xba\-Xba\ fragment of about 10 kbp. A plasmid resulting from incorporation of the Xbal-Xbal fragment 
into pTV1 1 8N vector was then named as plasmid pFU1 001 , and the Escherichia coli JM109 transformed with the plas- 
mid was named as Escherichia coli JM109/&FU1001 (FERM BP-5579). 

25 

(4) Analysis of DNA Polymerase-Constituting Protein of Pfu Polymerase C 

[0074] The above Xba\-Xba\ fragment containing the DNA polymerase gene, was again cut out from the above 
plasmid pFU1001 with Xba\, and blunt-ended using DNA blunting kit (manufactured by Takara Shuzo Co., Ltd.). The 

30 resultant was then ligated to new pTV1 1 8N vector, previously linearized with Smal, to yield plasmids for preparing dele- 
tion mutants. The resulting plasmids were named as pFU1002 and pFU1003, respectively, in accordance with the ori- 
entations of the inserts. Deletion mutants were prepared from sequentially deleting from both ends of the DNA insert 
using these plasmids. Kilo-Sequence Deletion kit (manufactured by Takara Shuzo Co., Ltd.) applying Henikoffs method 
(Gene, 28, 351-359) was used for the above preparation. The 3'-overhanging and 5*-overhanging restriction enzymes 

35 used were Psfl and Xbal respectively. The nucleotide sequence of the insert was determined by the dideaxy method 
using BcaBESTdideoxy sequencing kit (manufactured by Takara Shuzo Co., Ltd.) with the various deletion mutants as 
templates. The resulting nucleotide sequence was analyzed, and as a result, there were found six open reading frames 
(ORFs). The thermostable DNA polymerase activity was determined using the above various deletion mutants. The 
results demonstrated that the translation products of the ORF3 and the ORF4 were important in the exhibition of the 

40 DNA polymerase activity. The amino acid sequence of the ORF3 is shown in SEQ ID NO: 5 in Sequence Listing, and 
the amino acid sequence is shown in SEQ ID NO: 6 in Sequence Listing, respectively. In other words, the Pfu polymer- 
ase C is an enzyme comprising two kinds of the DNA polymerase-constituting proteins having amino acid sequences 
as shown in SEQ ID NO: 5 and SEQ ID NO: 6 in Sequence Listing, respectively 

45 Example 2 

(1) Preparation of Pfu Polymerase C 

[0075] Pfu polymerase C used as an antigen was prepared in the following manner. Escherichia coli 
so JM109/pFU1001 was cultured in 2 liter of LB medium (1 .0% trypton, 0.5% yeast extract, 0.5% NaCI, pH 7.2) containing 
100 ng/ml ampicillin. When the turbidity of the culture reached 0.6 in Aeoo. an inducer, isopropyl-p-D-thiogalactoside 
(IPTG) was added so as to have a final concentration of 1 mM, and cultured for additional 16 hours. After harvesting, 
the harvested cells were suspended in 37 ml of sonication buffer [50 mM Tris-HCI, pH 8.0, 2 mM 2-mercaptoethanol, 
10% glycerol. 2 mM PMSF (phenyl methanesulfony I fluoride)], and the suspension was treated with an ultrasonic dis- 
ss rupter. The supernatant resulting from centrifugation of the disrupted solution at 12,000 rpm for 10 minutes was heat- 
treated at 80°C for 15 minutes. Thereafter, centrifugation was again carried out at 12,000 rpm for 10 minutes and the 
supernatant was recovered, to yield 33 ml of a heat-treated supernatant. Subsequently, the above solution was sub- 
jected to 2-hour dialysis for 4 times with 2 liter of buffer A [50 mM potassium phosphate, pH 6.5, 2 mM 2-mercaptoeth- 
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anol, 10% glycerol] as a dlalysate. After dialysis, 32 ml of the enzyme solution was applied to RESOURCE Q column 
(manufactured by Pharmacia) which was previously equilibrated with buffer A, and the applied solution was chromato- 
graphed using FPLC system (manufactured by Pharmacia). The elution was carried out on a linear concentration gra- 
dient from 0 to 500 mM NaCI. A fraction having a DNA polymerase activity was eluted at 340 mM NaCI. 

5 [0076] Ten milliliters of an enzyme solution obtained by collecting an active fraction was concentrated by using Cen- 
triflow CF-50 (manufactured by Grace Japan), and the concentrated enzyme solution was then subjected to exchange 
with buffer A containing 150 mM NaCI with PD-10 column (manufactured by Pharmacia) to yield 3.5 ml of an enzyme 
solution. The resulting enzyme solution was then applied to HiTrap Heparin column (manufactured by Pharmacia), pre- 
viously equilibrated with the same buffer. An active fraction eluted at a concentration of 400 mM NaCI was obtained by 

to eluting with a linear concentration gradient from 150 to 650 mM NaCI using FPLC system. Five milliliters of this fraction 
was concentrated by ultrafiltration using Centricon-10 (manufactured by Amicon), and 120 pJ of the resulting concen- 
trate was applied to Superose 6 gel filtration column (manufactured by Pharmacia), previously equilibrated with 50 mM 
potassium phosphate buffer (pH 6.5) containing 75 mM NaCI and 2 mM 2-mercaptoethanol, and the elution was carried 
out with the same buffer. As a result, a fraction having a DNA polymerase activity was eluted at positions corresponding 

is to retention times of 34.7 minutes and 38.3 minutes. The fraction eluted at the position of 38.3 minutes was concen- 
trated, and the resulting concentrate was used as an antigen in the preparation of an arrti-Pfu polymerase C polyclonal 
antibody. 

[0077] Incidentally, in the purification of the above Pf u polymerase C. the enzyme activity was determined in the fol- 
lowing manner. An activated calf thymus DNA (manufactured by Worthington) (activated DNA) was used as a substrate. 

20 Determinations of DNA activation and DNA polymerase activity were carried out by the method described in DNA 
Polymerase from Escherichia co//, 263-276 (authored by C.C. Richardson), published by Harper & Row, edited by D.R. 
Davis. To 5 nl of a sample of which the activity was to be determined was added 45 ^1 of a reaction solution [20 mM 
Tris-HCI (pH 7.7), 1 5 mM MgCl2, 2 mM 2-mercaptoethanol, 0.2 mg/ml activated DNA, 40 nM each of dATP, dCTP, dQTP 
and dTTP, 60 nM ^HJ-dTTP (manufactured by Amersham)]. The resulting mixture was reacted at 75°C for 5 minutes. 

25 A 40 pi portion of this reaction mixture was then spotted onto DE paper (manufactured by Whatman) and washed with 
5% Na 2 HP0 4 five times. The remaining radioactivity on the DE paper was determined using a liquid scintillation coun- 
ter. The amount of enzyme which incorporated 10 nmol of [ 3 H]-dTMP per 30 minutes into the substrate DNA, deter- 
mined by the above-described enzyme activity determination method, was defined as one unit of the enzyme. 

30 (2) Preparation of Anti-Pfu Polymerase C Antibody 

[0078] The above Pfu polymerase C preparation was diluted with 50 mM potassium phosphate, pH 6.5, 2 mM 2- 
mercaptoethanol, and 75 mM NaCI so as to have a concentration of 1 mg/100 pi. Thereto was added an equal volume 
of complete Round's adjuvant, and the mixture was emulsified. The resulting emulsion was subcutaneously injected at 

35 50 (il per injection to rabbit 4 times in 3-week intervals. Whole blood was extracted 1 0 days after the final immunization, 
and the extracted blood was allowed to stand at room temperature for 60 minutes. Thereafter, the Wood was centrtf uged 
to yield 60 ml of antisera containing anti-Pfu polymerase C polyclonal antibody. To 20 ml of the antisera was added 20 
ml of saturated ammonium sulfate solution. The mixture was gently stirred at 4°C for 45 minutes, and centrifuged. The 
resulting precipitate was suspended in 5 ml of 20 mM sodium phosphate buffer, pH 7.0, and the suspension was sub- 

40 jected to a 2-hour dialysis for 3 times using 2 liters of the same buffer as a dialysate. After dialysis, 14 ml of the solution 
was applied to protein A column (manufactured by Pharmacia), previously equilibrated with 20 mM sodium phosphate 
buffer (pH 7.0), washed with the same buffer, and then eluted with 0.1 M sodium citrate buffer (pH 3.0). The eluted anti- 
Pfu polymerase C polyclonal antibody was neutralized with 1 M Tris-HCI, pH 9.0, and concentrated with Centriflow CF- 
50, and subjected to exchange with coupling buffer (0.5 M NaCI, 0.2 M NaHC0 3 , pH 8.3) with PD-10 column (manufac- 

45 tured by Pharmacia), to prepare a solution containing anti-Pfu polymerase C polyclonal antfoody. 

(3) Preparation of Anti-Pfu Polymerase C Antibody Column 

[0079] HiTrap NHS-activated column (manufactured by Pharmacia) was washed with 6 ml of 1 mM HCI. and 0.9 ml 
so of the above anti-Pfu polymerase C polyclonal antibody solution (containing 3.6 mg equivalent of the anti-Pfu polymer- 
ase C polyclonal antibody) was then applied to HiTrap NHS-activated column. After allowing to stand at room temper- 
ature for 1 hour, the resulting column was washed with 3 ml of the coupling buffer. Subsequently, the column was 
sequentially washed with 6 ml of blocking buffer (0.5 M Tris-HCI, pH 8.3, 0.5 M NaCI). 6 ml of buffer B (0.1 M sodium 
acetate, pH 4.0, 0.5 M NaCI), and 6 ml of the blocking buffer, and the resulting mixture was allowed to stand at room 
55 temperature for 30 minutes. Further, the column was washed with 6 ml of buffer B, 6 ml of the blocking buffer, and 6 ml 
of buffer B, and thereafter the column was equilibrated with 50 mM Tris-HCI, pH 8.0, to prepare an anti-Pfu polymerase 
C antibody column. 
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Example 3 

(1) Purification of Complex Comprising Pfu Polymerase C Using Anti-Pfu Polymerase C Antibody Column 

5 [0080] Pyroooccus furiosus DSM3638 was cultured in two medium bottles for 1 6 hours in the same manner as the 
method described in Example 1 . After harvesting, cells were suspended in 34.7 ml of buffer C (50 mM Tris-HCI, pH 8.0, 
1 mM ATP) containing 2 mM PMSF, and the suspension was treated with an ultrasonic disrupter. The disrupted solution 
was centrifuged at 12,000 rpm for 10 minutes, and 46 ml of the supernatant obtained was applied to an anti-Pfu 
polymerase C antibody column, previously equilibrated with buffer C. After the column was washed with buffer C, the 

10 complex comprising Pfu polymerase C was eluted with elution buffer (0. 1 M glycine-HCI, pH 2.5, 1 mM ATP). After neu- 
tralization with 1 M Tris-HCI, pH 9.0, the eluate was concentrated using Centrif low CF-50 to yield a Pfu polymerase C 
complex concentrate. 

(2) Analysis of Pfu Polymerase C Complex 

75 

[0081] The Pfu polymerase C complex concentrate was subjected to SDS-PAGE (12.5% polyacrylamide gel; 25 
mM Tris-HCI, 192 mM glycine, 0.1% SDS, pH 8.4 being used as electrophoresis buffer). The gel obtained was analyzed 
by Western blotting using the anti-Pfu polymerase C antibody by the method shown below. After SDS-PAGE, the gel 
was immersed in blotting buffer 1 (25 mM Tris-HCI, 20% methanol, pH 9.4) containing 40 mM e-amino-n-caproic acid. 

20 Next, filter papers immersed in blotting buffer 2 (0.3 M Tris-HCI, 20% methanol, pH 10.4), filter papers immersed in 25 
mM Tris-HCI and 20% methanol, pH 10.4, a PVDF membrane immersed in blotting buffer 1 containing 40 mM e-amino- 
n-caproic acid, the above gel, and filter papers immersed in blotting buffer 1 containing 40 mM e-amino-n-caproic acid 
were overlaid on semi-dry blotting apparatus (manufactured by Scientific), and blotting was carried out at 2 mA/cm 2 for 
1 hour. This PVDF membrane was immersed in Block Ace (manufactured by Snow Brand Milk Products Co., Ltd.) con- 

25 taining 0.01% thimerosal, shaken for 10 minutes, and thereafter the membrane was immersed in an anti-Pfu polymer- 
ase C antiserum, previously diluted 1,000 fold with Block Ace containing 0.01% thimerosal. After allowing to stand at 
room temperature for 1 hour, the membrane was washed thrice for 10 minutes with TBS buffer (50 mM Tris-HCI, pH 7.5, 
150 mM NaCi) containing 0.02% Tween-20 and further washed with TBS buffer. The membrane was then immersed in 
a peroxidase-labeled anti-rabbit IgG (Fc) antibody (manufactured by Organon-Technica), previously diluted 5,000 fold 

30 with Block Ace containing 0.01% thimerosal. After allowing to stand at room temperature for 1 hour, the PVDF mem- 
brane was washed thrice for 10 minutes with TBS buffer containing 0.02% Tween-20 and further washed with TBS 
buffer. Thereafter, the membrane was immersed in Konica Immunostain HRP-1000 (manufactured by Konica Corpora- 
tion) to allow color development. From the results of staining of the gel after SDS-PAGE with Coomassie Brilliant Blue 
R-250, shown in Figure 1, and the results of the Western blotting mentioned above, it was elucidated that the above 

35 complex fraction contained seven kinds of proteins (F1 to F7 in Figure 1 ) unreactive with the anti-Pfu polymerase C anti- 
body. 

[0082] Since the bands unreactive with the anti-Pfu polymerase C antibody are considered to be proteins adsorbed 
to the column via Pfu polymerase C, N-terminal amino acid sequences of these proteins were analyzed by the method 
described below. The Pfu polymerase C complex concentrate obtained in Example 3(1) was subjected to SDS-PAGE 
40 and blotted onto a PVDF membrane in the same manner as above. After this membrane was stained with Coomassie 
Brilliant Blue R-250. the desired bands were cut out. The N-terminal amino acid sequences of the desired proteins were 
determined by automatic Edman decomposition with G1000A Protein Sequencer (manufactured by Hewlett-Packard 
Company) using these membrane fragments as samples. The results are shown in Table 1 . The N-terminal amino acid 
sequences obtained, F1 to F5 and F7, are shown in SEQ ID NOs: 7 to 12, respectively, in Sequence Listing. 

45 



Table 1 



Sample 


N-Terminal Amino Acid 
Sequence 


F1 


MDKEGFLNKVREAVDWKLH 


F2 


MFTGKVLIPVKVLKKFENWN 


F3 


MIGSIFYSKKFNLHRPSEYH 


F4 


MKDYRPLLGAIKVKGDNVFS 


F5 


MDIEVLRRLLERELSSEH 


F6 


Unable to be analyzed 
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Table 1 (continued) 



Sample 


N-Terminal Amino Acid 
Sequence 


F7 


PFEIVFEGAKEFAQLID 



Example 4 

Preparation of Cassette DNAs 

10 

[0083] Ten micrograms of Pyrococcus fuhosus genomic DNA prepared in Example 1 was completely digested with 
EcoRI (manufactured by Takara Shuzo Co., Ltd.), and 500 ng equivalent of the digest was mixed with 50 ng of EcoRI 
cassette (manufactured by Takara Shuzo Co., Ltd.), followed by ligation. The DNA recovered from the ligation reaction 
mixture for ligation by ethanol precipitation was dissolved in 20 nJ of sterilized water, and this solution was used as 

15 EcoH\ cassette DNA for the subsequent procedures. 

[0084] Using similar procedures as those described above, cassette DNAs ligated with each of HindlW cassette, 
Xbal cassette, Sail cassette, Pstl cassette and Sau3A\ cassette (all manufactured by Takara Shuzo Co., Ltd.) were pre- 
pared. When ligated with the Xba\ cassette, genomic DNA digested with two enzymes, i.e., Xba\ and Nhel, was used, 
and each of the DNAs obtained were named Xba\ cassette DNA and Nhe\IXba\ cassette DNA, respectively. When 

20 ligated with the Sail cassette, genomic DNA digested with the two enzymes Sail and Xho\ was used, and each of the 
DNAs obtained were named Sal\ cassette DNA and Xho\ISal\ cassette DNA, respectively. When ligated with the 
Sa;3AI cassette, genomic DNA digested with Bgfll was used, and the DNA obtained was named BglWISauZM cassette 
DNA. 

25 Example 5 

(1) Selection of Cosmid Clones Carrying F1 Gene 

[0085] On the basis of the N-terminal amino acid sequence of F1 obtained in Example 3, the primers F1 -1 and F1 - 

30 2, of which nucleotide sequences are shown in SEQ ID NOs: 13 and 14, respectively, in Sequence Listing, were syn- 
thesized. First PCR was carried out using 100 pmol each of F1 -1 and the cassette primer C1 {manufactured by Takara 
Shuzo Co., Ltd.) with 1 ^l of the EcoRI cassette DNA prepared in Example 4 as a template. Second PCR was carried 
out using 100 pmol each of F1-2 and the cassette primer C2 (manufactured by Takara Shuzo Co., Ltd.) with 1 ul of the 
resulting reaction mixture obtained as above as a template. For the two PCRs, Pfu DNA polymerase (a-type enzyme, 

35 manufactured by STRATAGENE) was used. The reaction mixture composition and reaction conditions are shown 
below: The reaction mixture comprises 20 mM Tris-HCI, pH 8.2, 10 mM KCI, 20 mM MgC^, 6 mM (NH^SO* 0.2 mM 
each of dATP, dCTP, dGTP, dTTP, 1% Triton X-100, 0.01% BSA and 2.5 units of Pfu DNA polymerase (final volume 
being 100 ul), and the reaction was carried out in 30 cycles for the first PCR and in 25 cycles for the second PCR, 
wherein one cycle comprises a process consisting of at 94°C (30 seconds) - 45°C (30 seconds) - 72°C (2 minutes). The 

40 PCR using Pfu DNA polymerase described in the Examples below was also carried out using the same reaction mixture 
composition. An amplified DNA fragment of about 550 bp was subcloned into plasmid vector pUC1 19 (manufactured 
by Takara Shuzo Co., Ltd.), and its nucleotide sequence was determined. Thereafter, on the basis of the sequence 
determined, the primers F1S1 and F.S2, of which nucleotide sequences are shown in SEQ ID NOs: 15 and 16, respec- 
tively, in Sequence Listing, were then synthesized. PCR was carried out using these F1S1 and F1S2 with the cosmid 

45 DNA mentioned in Example 1 as a template, whereby selecting cosmid clones carrying the F1 gene. This PCR was car- 
ried out using TaKaRa PCR amplification kit (manufactured by Takara Shuzo Co., Ltd.) in accordance with the instruc- 
tions attached. As a result, there were found that cosmid clone Nos. 22, 46, 61, 133, 178, 180, 210 and 317 carry the 
F1 gene. 

so (2) Subdoning of F1 Gene 

[0086] PCR was carried out using 20 pmol each of F1 S1 and the cassette primer C2, or each of F1 S2 and the cas- 
sette primer C2, with 1 uJ of the HindlW cassette DNA prepared in Example 4 as a template. The PCR was carried out 
with the same reaction mixture composition as that used in Example 5(1) using Pfu DNA polymerase as an enzyme in 
55 50 cycles, wherein one cycle comprises a process consisting of at 94°C (30 seconds) - 55°C (30 seconds) - 72°C (3 
minutes). As a result, a DNA fragment of 570 bp was amplified by F1S2 and the cassette primer C2, whereas no DNA 
was amplified by F1S1 and the cassette primer C2. This finding anticipated that the HindlW site is located immediately 
upstream of the initiation codon for the F1 gene and at a distance from the annealing position of FTS1 such that DNA 
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cannot be amplified by Pfu DNA polymerase. With this in mind, CosmkJ Clone No. 61 . randomly selected from the cos- 
mid clones carrying the F1 gene, was digested with HindUl, and DNA fragments of not smaller than 1 .5 Kb were iso- 
lated, and each was subcloned into plasmid vector pTV118N (manufactured by Takara Shuzo Co., Ltd.)- PCR was 
carried out using F1 S1 and F1 S2 as primers with each recombinant plasmid obtained as a template, to examine for the 
5 presence of the F1 gene. As a result, it was found that a Hind\\\ fragment of about 2 kb carries the F1 gene. A plasmid 
in which the F1 gene in this DNA fragment ligated to downstream of the lac promoter of pTV1 18N vector was named 
pFI-4-10. As to the DNA inserts contained in this plasmid, a restriction endonuclease map for A/col, EcoRI, BamH\, 
Pstl Sac\ and A/del was prepared. The results as shown in Figure 2 were obtained. 

w (3) Determination of Nucleotide Sequence of DNA Fragment Carrying F1 Gene 

[0087] There was determined by the dideoxy method the nucleotide sequence of the DNA insert in the plasmid 
pF1-4-10 and each plasmid obtained by cutting out the Nco\-Hind\\\, EcoRI-EcoRI, BamH\-Pst\ t EcoK\-Hind\\\, 
Hind\\\-EcoB\ and Hind\\\-BamH\ fragments from the plasmid, and subcloning each of the resulting fragments into plas- 

is mid vector pTV1 19N (manufactured by Takara Shuzo Co., Ltd.). A sequence of 2,009 bp in the nucleotide sequences 
of the DNA insert in pF1-4-10 determined totally on the basis of these results combined together is as shown in SEQ 
ID NO: 17 in Sequence Listing. As a result of analyzing the nucleotide sequence, there was revealed an open reading 
frame comprising the N-terminal amino acid sequence of F1. The above sequence is shown in SEQ ID NO: 18 in 
Sequence Listing, and the amino acid sequence of the F1 translation product as deduced from the above sequence is 

20 shown in SEQ ID NO: 19 in Sequence Listing, respectively. This amino acid sequence was searched for homology to 
the amino acid sequences of known proteins. As a result, it was found to be homologous to the Haemophilus 
influenzae-6er\ved single-stranded DNA-specific exonuclease [Sc/ence, 269, 496-512 (1995)]. The homology was 
23.2% for the first half and 24.3% for the last half. 

25 (4) Construction of Plasmid for F1 Expression 

[0088] PCR was carried out using the primer F1 Nc, of which nucleotide sequence is shown in SEQ ID NO: 20 in 
Sequence Listing, and the above primer F1 S2 with the plasmid pF1 -4-10 described in Example 5(2) as a template. The 
PCR was carried out in a reaction mixture of the same composition as that used in Example 5(1) using Pfu DNA 

30 polymerase. Using 1 ng of template DNA and 20 pmol each of the two primers, the reaction was carried out in 25 cycles, 
wherein one cycle comprises a process consisting of at 94°C (30 seconds) - 55°C (30 seconds) - 72°C (2 minutes). A 
fragment obtained by digesting an amplified DNA fragment of about 460 base pairs with Nco\ and BglWl (both manu- 
factured by Takara Shuzo Co., Ltd.) and a DNA fragment obtained by digesting the above plasmid pFI-4-10 with BglW 
and HindlW were together inserted between the Nco\ and Hind\\\ sites of plasmid vector pTV118N (manufactured by 

35 Takara Shuzo Co., Ltd.). This plasmid was named pF1Nc-2. Of the DNA insert in the plasmid, in the PCR-amplified 
region, the nucleotide sequence was confirmed by the dideoxy method that there is no mutation caused by PCR. 

(5) Preparation of Purified F1 Authentic Sample 

40 [0089] Escherichia coli JM1 09/pF1 Nc2, Escherichia coli JM1 09 transformed with the plasmid pF1 Nc-2 obtained in 
Example 5(4), was cultured for 16 hours in 2 liters of LB medium containing 100 jig/ml ampicillin. After harvesting the 
cells, 33 ml of a heat-treated supernatant was obtained in the same manner as Example 2(1). Next, this solution was 
applied to RESOURCE Q column (manufactured by Pharmacia), previously equilibrated with buffer D (50 mM Tris-HCi, 
pH 8.0, 2 mM 2-mercaptoethand, 10% glycerol), and the applied solution was chromatographed using FPLC system 

45 (manufactured by Pharmacia). The elution was carried out on a linear concentration gradient of 0 to 500 mM NaCI. F1 
was eluted at 340 mM NaCI. 

[0090] After 10 ml of the enzyme solution obtained by collecting the F1 fraction was concentrated using Cerrtrrflow 
CF50, the resulting concentrate was subjected to exchange with buffer D using PD-10 column (manufactured by Phar- 
macia), and 3.5 ml of the solution was applied to HiTrap Blue column (manufactured by Pharmacia), previously equili- 

so brated with the same buffer. Using FPLC system, the column was washed with buffer D, and thereafter F1 was eluted 
with buffer D containing 2 M NaCI. Five milliliters of this fraction was concentrated using Centricon-10, and 120 nl of the 
concentrate was applied to Superdex 200 gel filtration column (manufactured by Pharmacia), previously equilibrated 
with 50 mM Tris-HCI, pH 8.0, containing 2 mM 2-mercaptoethanol and 75 mM NaCI. The elution was carried out with 
the same buffer, and as a result, F1 was eluted at a position corresponding to a molecular weight of about 49 Wlodal- 

55 tons. This molecular weight corresponds to the case where F1 is present as a monomer. 
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(6) Determination of Exonuciease Activity 

[0091] The 5* -> 3' and 3* 5' exonuciease activities of the purified F1 authentic sample were examined in the fol- 
lowing manner. 

5 [0092] First, plasmid vector pUC1 19 (manufactured by Takara Shuzo Co., Ltd.) was digested with Ssp\ (manufac- 
tured by Takara Shuzo Co., Ltd.) and subjected to agarose gel electrophoresis, and a DNA fragment of 386 bp was 
recovered from the gel and purified. This DNA fragment was labeled at the 5'-terminus using [y- 32 P]-ATP (manufactured 
by Amersham) and polynucleotide kinase (manufactured by Takara Shuzo Co., Ltd.), and the ^PJabeled DNA frag- 
ment obtained was used as a substrate for detecting the 5* -» 3' exonuciease activity. In addition, plasmid vector 

10 pUC1 19 was digested with Sau3AI (manufactured by Takara Shuzo Co., Ltd.), and a DNA fragment of 341 bp obtained 
was recovered and purified in the same manner as above. Furthermore, this DNA fragment was ^P-labeled at the 3'- 
terminus by the fill-in reaction using [a- 32 P]-dCTP (manufactured by Amersham) and Klenow fragment (manufactured 
by Takara Shuzo Co., Ltd.) to yield a substrate for detecting the 3' -» 5' exonuciease activity. The above two kinds of 
labeled DNAs were purified by gel filtration through NICK column (manufactured by Pharmacia) and used for the reac- 

75 tion described below. 

[0093] Ten microliters of a reaction mixture (20 mM Tris-HCI, pH 7.7, 1 5 mM MgCI 2 , 2 mM 2-mercaptoethanol) con- 
taining 2 ng of each of these labeled DNA fragments and 12.5 ug of digest obtained by completely digesting A.-DNA 
(manufactured by Takara Shuzo Co., Ltd.) with Haelll (manufactured by Takara Shuzo Co., Ltd.), and the above purified 
F1 authentic sample was prepared and reacted at 85°C for 2.5, 5 or 7.5 minutes, and thereafter ethanol precipitation 
20 was carried out to precipitate the DNA. By determining the radioactivity in this supernatant using a liquid scintillation 
counter, the amount of substrate decomposed by exonuciease activity was determined. In the determination of the 5' 
-> 3' exonuciease activity, 50 fmol of the purified F1 authentic sample was added, and in the determination of the 3' -> 
5' exonuciease activity, 125 pmol of the purified F1 authentic sample was added. These results are shown in Figures 3 
and 4, respectively. 

25 [0094] Figure 3 shows the results for the determination of 5 -► 3' exonuciease activity, and Figure 4 shows the 
results for determination of the 3' -> 5' exonuciease. In the figures, the abscissa indicates reaction time, and the ordi- 
nate indicates the ratio of radioactivity released in the supernatant to that contained in the entire reaction mixture. In 
addition in the figures, solid circles indicate the results obtained with the purified F1 authentic sample of the present 
invention, and open circles indicate a blank reaction without adding the purified F1 authentic sample. As shown in the 

30 figures, the purified F1 authentic sample of the present invention possesses both 5' 3* and 3' -> 5* exonuciease activ- 
ities. Also, from the above results it was demonstrated that the 5' -+ 3' exonuciease activity is about 500 times as great 
as the 3' -» 5* exonuciease activity. 

Example 6 

35 

(1) Selection of Cosmid Clones Carrying F2 Gene 

[0095] On the basis of the N-terminal amino acid sequence of F2 obtained in Example 3, the primers F2-2 and F2- 
3, of which nucleotide sequences are shown in SEQ ID NOs: 21 and 22, respectively, in Sequence Listing, were syn- 

40 thesized. First PGR was carried out using 100 pmol of the primer F2-2 and 20 pmol of the cassette primer C1 with 1 uJ 
of the Xba\ cassette DNA prepared in Example 4 as a template. Second PCR was carried out using 100 pmol of the 
primer F2-3 and 20 pmol of the cassette primer C2 with 1 id of the resulting reaction mixture obtained as above as a 
template. For the two PCRs, Pf u polymerase C was used. The reaction mixture composition and reaction conditions are 
shown below: The reaction mixture comprises 10 mM Tris-HCI, pH 9.2, 75 mM KCI, 3.5 mM MgCI 2 . 0.4 mM each of 

45 dATP, dCTP, dGTP and dTTP, 0.1% Triton X-100, 0.01% BSA and 2.0 units of Pfu polymerase C (final volumebeing 100 
uJ), and the reaction was carried out in 30 cycles for the first PCR and 25 cycles for the second PCR, wherein one cycle 
comprises a process consisting of at 94°C (30 seconds) - 45°C (30 seconds) - 72°C (2 minutes). An amplified DNA frag- 
ment of about 250 bp was subcloned into plasmid vector pUC1 1 9, and its DNA sequence was determined. On the basis 
of the sequence determined, the primers F2S3 and F2S4, of which nucleotide sequences are shown in SEQ ID NOs: 

so 23 and 24, respectively, in Sequence Listing, were then synthesized. PCR was carried out using these primers with the 
cosmid DNA prepared in Example 1 as a template, whereby selecting cosmid clones carrying the F2 gene. The PCR 
was carried out in a reaction mixture of the same composition as that used in Example 5(1) using Pfu DNA polymerase 
as an enzyme and 20 pmol each of the primers in 25 cycles, wherein one cycle comprises a process consisting of at 
94°C (30 seconds) - 55°C (30 seconds) - 72°C (2 minutes). As a result, there was found that Cosmid Clone No. 172 

55 carries the F2 gene. 
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(2) Subcloning of T2 Gene 

[0096] PCR was carried out using 20 pmol each of F2S3 and the cassette primer C2 or each of F2S4 and the cas- 
sette primer C2 as primers with 1 pJ of each of the NheUXbal and XhoUSali cassette DNAs of Example 4 as a template. 

5 The PCR was carried out in a reaction mixture of the same composition as that used in Example 5(1) using Pfu DNA 
polymerase as an enzyme in 50 cycles, wherein one cycle comprises a process consisting of at 94°C (30 seconds) ■ 
55°C (30 seconds) - 72°C (3 minutes). As a result, each of amplified DNA fragments of about 700 bp and of about 1 ,400 
bp for the Nhe\IXba\ and XhoUSali cassette DNAs, respectively, was amplified by the primer pair of F2S3 and the cas- 
sette primer C2, whereas no DNA was amplified by the primer pair of F2S4 and the cassette primer C2. This finding 

w anticipated that the Nhe\ and Xho\ sites are located at a distance from the annealing position of the F2S4 primer unam- 
plrf iable with Pfu DNA polymerase. 

[0097] With this in mind, the various DNA fragments obtained by digesting No. 172 with Nhe\ were cut out, and 
each was suboloned into plasmid vector pTV1 18N (manufactured by Takara Shuzo Co., Ltd.). PCR was carried out 
using F2S3 and F2S4 as primers with each recombinant plasmid obtained as a template, to examine whether or not the 

is F2 gene is present. As a result, it was found that an Nhe\ fragment of about 8 kb carries the F2 gene. A plasmid result- 
ing from insertion of this Nhe\ fragment into pTV1 18N was named plasmid pF2172Nh. In addition, a restriction endo- 
nuclease map was prepared for the DNA insert in this plasmid. The results as shown in Figure 5 were obtained. 
[0098] On the basis of the restriction endonuclease map shown in Figure 5, the plasmid pF2172Nh was digested 
with HindUl and a Hind\\\ fragment of about 1 .5 Kb was cut out and each was subdoned into plasmid vector pTVl 1 8N. 

20 The recombinant plasmid obtained was examined for the insert orientation of the F2 gene, and there was found that the 
F2 gene was inserted in the reverse orientation with respect to the lac promoters of all of the vectors. This plasmid was 
named pF2172H16. Escherichia coli JM109^)F2172H16, Escherichia coli JM109 transformed with this plasmid, was 
examined for F2 expression, and found not to be highly expressed. With this in mind, in order to tigate the F2 gene in 
the orthodox orientation for the vector, pF2172H16 was digested with Hind\\\ and EcoRI, and the H/ncflll-EcoRI frag- 

25 ment cut out was ligated to plasmid vector pTV1 19Nd (those resulting from substitution of the Nco\ site with Nde\ in 
plasmid vector pTV119N manufactured by Takara Shuzo Co.. Ltd.). The recombinant plasmid obtained was named 
pF2172HE11, and Escherichia coli JM109 transformed with this plasmid was named Escherichia coli 
JM109/pF2172HE11. 

30 (3) Preparation of F2 Authentic Sample 

[0099] Escherichia coli JM109/|dF2172HE1 1 obtained in Example 6(2) was cultured for 16 hours in 2 liters of LB 
medium containing 1 mM IPTG and 100 >ig/ml ampicillin. After harvesting, cells were suspended in 23.4 ml of sonica- 
tion buffer, and 19.5 ml of a heat-treated supernatant was obtained in the same manner as Example 2(1). Next, this 
35 solution was applied to RESOURCE Q column, previously equilibrated with buffer D, and the applied solution was chro- 
matographed using FPLC system. F2 flowed through RESOURCE Q column. 

[0100] Twenty-two milliliters of the flow-through F2 fraction was applied to RESOURCE S column (manufactured by 
Pharmacia), previously equilibrated with buffer D. Using FPLC system, the elution was carried out on a linear concen- 
tration gradient of 0 to 500 mM NaCI, and an F2 fraction was eluted at 170 mM Nad. This fraction was concentrated 
40 using Centricon-10. and 75 ni of the concentrate obtained was applied to Superdex 200 gel filtration column, previously 
equilibrated with 50 mM Tris-HCI buffer (pH 8.0) containing 2 mM 2-mercaptoethanol and 75 mM NaCI. The elution was 
earned out with the same buffer, and as a result, F2 was eluted at a position corresponding to a molecular weight of 
about 120 kilodaltons or about 45 kilodaltons. This molecular weight corresponds to the case where F2 has formed a 
hexamer or dimer. 

45 

(4) Determination of Nucleotide Sequence of DNA Fragment Carrying F2 Gene 

[0101] The nucleotide sequence of the DNA insert in the above plasmid pF2172HE1 1 was determined by the dide- 
oxy method. A sequence of 957 bp of the nucleotide sequence determined is shown in SEQ ID NO: 25 in Sequence 
so Listing. As a result of analyzing the nucleotide sequence, there was found an open reading frame having the N-terminal 
amino acid sequence of F2. The nucleotide sequence of this open reading frame is shown in SEQ ID NO: 26 in 
Sequence Listing, and the amino acid sequence of the F2 translation product as deduced from the nucleotide sequence 
is shown in SEQ ID NO: 27 in Sequence Listing, respectively. This amino acid sequence was searched for homology to 
the amino acid sequences of known proteins, and as a result, the homologous proteins were not found. 

55 
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Example i 

(1) Selection of Cosmid Clones Carrying F4 Gene 

5 [0102] On the basis of the N-terminal amino acid sequence of F4 obtained in Example 3, the primers F4-1 and F4- 
2, of which nucleotide sequences are shown in SEQ ID NOs: 28 and 29. respectively, in Sequence Listing, were syn- 
thesized. First PCR was carried out using 100 pmol of the primer F4-1 and 20 pmol of the cassette primer C1 with 1 |il 
of the Hind\\\ cassette DNA of Example 4 as a template. Second PCR was carried out using F4-2 and the cassette 
primer C2 with 1 uJ of the reaction mixture as a template. The PCR was carried out in a reaction mixture of the same 

10 composition as that used in Example 5(1 ) using Pf u DNA polymerase as an enzyme in 30 cycles for the first PCR and 
25 cycles for the second PCR, wherein one cycle comprises a process consisting of at 94°C (30 seconds) - 45°C (30 
seconds) - 72°C (2 minutes). An amplified DNA fragment of about 1,100 bp by this reaction was subcloned into plasmid 
vector pUC119, and a part of its nucleotide sequence was determined by the dideoxy method using M4 and RV primers 
(manufactured by Takara Shuzo Co., Ltd.). On the basis of the sequence determined, the primers F4S1 and F4S2. of 

15 which nucleotide sequences are shown in SEQ ID NOs: 30 and 31. respectively, in Sequence Listing, were then syn- 
thesized. PCR was carried out using these F4S1 and F4S2 primers with the oosmid DNA prepared in Example 1 as a 
template, whereby selecting cosmid clones carrying the F4 gene. The PCR was carried out in a reaction mixture of the 
same composition as that used in Example 5(1) using Pfu DNA polymerase as an enzyme in 30 cycles, wherein one 
cycle comprises a process consisting of at 94°C (30 seconds) - 55°C (30 seconds) - 72°C (1 minute). As a result, it was 

so found that Cosmid Clone Nos. 1 6. 26. 88. 11 2, 250, 269. 427 and 451 carry the F4 gene. 

(2) Subcloning of F4 Gene 

[0103] PCR was carried out using 20 pmol each of F4S2 and the cassette primer C2 with 1 uJ of the Xba\ cassette 
25 DNA of Example 4 as a template. The PCR was earned out in a reaction mixture of the same composition as that used 
in Example 5(1) using Pfu DNA polymerase as an enzyme in 50 cycles, wherein one cycle comprises a process con- 
sisting of at 94°C (30 seconds) - 55°C (30 seconds) - 72°C (3 minutes). As a result, a DNA fragment of about 700 bp 
was amplified with F4S2 and the cassette primer C2. Also, PCR was carried out under the same conditions using F4- 
2 and the cassette primer C2 with Hind\\\ cassette DNA as a template. As a result, a DNA fragment of about 1,100 bp 
30 was amplified. These findings suggested that the F4 gene is present in an XbaVH'md\\\ fragment of about 1 .6 kh. With 
this in mind, Cosmid No. 16 was digested with Xba\ and Hind\\\, and a DNA fragment of about 1 .6 kb was cut out. and 
each was subcloned into pTV1 18N vector. PCR was earned out using the F4S1 and F4S2 primers with each recom- 
binant plasmid obtained as a template, in order to examine for the presence of the F4 gene. As a result a plasmid har- 
boring a 1 .6 kb Xba\-Hind\\\ fragment carrying the F4 gene was obtained, and this plasmid was named plasmid pF4-1- 
35 4. Also, this plasmid was digested with the restriction enzymes A/col, EooRI. BamH\ t Pst\ t $ac\ and A/del. As a result, 
it was found that none of these sites were present in the above plasmid or DNA insert. 

(3) Determination of Nucleotide Sequence of DNA Fragment Carrying F4 Gene 

40 [0104] The nucleotide sequence of the DNA insert in the above plasmid pF4-1-4 was determined by the dideoxy 
method. 

[0105] A sequence of 1,012 bp of the nucleotide sequence determined is shown in SEQ ID NO: 32 in Sequence 
Listing. As a result of analyzing the nucleotide sequence, there was found an open reading frame having the N-terminal 
amino acid sequence of F4. The nucleotide sequence of this open reading frame is shown in SEQ ID NO: 33 in 
45 Sequence Listing, and the amino acid sequence of the F4 translation product as deduced from the nucleotide sequence 
is shown in SEQ ID NO: 34 in Sequence Listing, respectively. This amino acid sequence was searched for homology to 
the amino acid sequences of known proteins, and as a result, the homologous proteins were not found. 

(4) Construction of Plasmid for F4 Expression 

50 

[0106] PCR was carried out in a reaction mixture of the same composition as that used in Example 5(1) with Pfu 
DNA polymerase using the primer F4NNd, of which nucleotide sequence is shown in SEQ ID NO: 35 in Sequence List- 
ing, and the primer F4CEc, of which nucleotide sequence is shown in SEQ ID NO: 36 in Sequence Listing, with the plas- 
mid pF4-1 -4 described in Example 7(3) as a template. The reaction conditions are shown below. Using 1 ng of template 
55 DNA and 20 pmol each of the two primers, the reaction was carried out in 25 cycles, wherein one cycle comprises a 
process consisting of at 94°C (30 seconds) - 55°C (30 seconds) - 72°C (2 minutes). An amplified DNA fragment of 
about 450 bp was digested with Afcfel and EcoRI (both manufactured by Takara Shuzo Co., Ltd.), and the DNA fragment 
obtained was inserted between the Nde\ and EcoRI sites of plasmid vector pTV1 1 9Nd mentioned above to prepare the 
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plasmid pF4Nd-6. Furthermore, the nucleotide sequence of the DNA insert in the plasmid was determined by the dide- 
oxy method. It was confirmed that there is no mutation caused by PCR. 

(5) Preparation of Purified F4 Authentic Sample 

5 

[0107] Escherichia coli JM1 09/p4Nd-6, Escherichia coli JM109 transformed with the plasmid pF4Nd-6 obtained in 
Example 7(4), was cultured for 16 hours in 2 liters of LB medium containing 100 ug/ml ampiciilin. After harvesting, cells 
were suspended in 33.4 ml of sonication buffer, and 28 ml of a heat-treated supernatant was obtained in the same man- 
ner as Example 2(1). Next, this solution was applied to RESOURCE Q column, previously equilibrated with buffer D, 
10 and the applied solution was chromatographed using FPLC system. The elution was carried out on a linear concentra- 
tion gradient of 0 to 500 mM NaCI. F4 was eluted at a concentration of 325 mM NaCI. 

[0108] Three milliliters of the solution obtained by collecting the F4 fraction was subjected to exchange with buffer 
D containing 150 mM NaCI using PD-10 column, and 6.9 ml of the solution was applied to HiTrap Heparin column, pre- 
viously equilibrated with the same buffer. F4 was not adsorbed to HiTrap Heparin column, and (NH 4 ) 2 S0 4 was added 

15 to 7.2 ml of the F4 fraction flowed through the column so as to have a final concentration of 1 M. This solution was 
applied to HiTrap Phenyl column (manufactured by Pharmacia), previously equilibrated with buffer D containing 1 M 
(NH 4 ) 2 S0 4 . Using FPLC system, the column was washed with each of 1 M and 0.5 M (NH 4 ) 2 S0 4l and thereafter F4 
was eluted with buffer D. Five milliliters of this fraction was concentrated using Centricon-10, and 76 uJ of the concen- 
trate obtained was applied to Superdex 200 gel filtration column, previously equilibrated with 50 mM Tris-HCI buffer, pH 

20 8.0, containing 2 mM 2-mercaptoethanol and 75 mM NaCI. As a result of the elution with the same buffer, F4 was eluted 
at a position corresponding to a molecular weight of about 39 kilodaltons. This molecular weight corresponds to the 
case where F4 has formed a dimer or trimer. 

Example 8 

25 

(1) Selection of Cosmid Clones Carrying F7 Gene 

[01 09] On the basis of the N-terminal amino acid sequence of F7 obtained in Example 3, the primers F7-1 and F7- 
2, of which nucleotide sequences are shown in SEQ ID NOs: 37 and 38, respectively, in Sequence Listing, were syn- 

30 thesized. First PCR was carried out using 100 pmol of F7-1 and 20 pmol of the cassette primer C1 with 1 uJ of the 
Hind\\\ cassette DNA prepared in Example 4 as a template. Second PCR was carried out using 100 pmol of the primer 
F7-2 and 20 pmol of the cassette primer C2 with 1 uJ of the reaction mixture obtained as above as a template. The PCR 
was carried out using the same reaction mixture composition and reaction conditions as those used in Example 6(1). 
An amplified DNA fragment of about 830 bp was subdoned into plasmid vector pUCl 19, and its nucleotide sequence 

35 was determined. On the basis of the sequence determined, the primers F7S1 and F7S2, of which nucleotide 
sequences are shown in SEQ ID NOs: 39 and 40, respectively, in Sequence Listing, were then synthesized. PCR was 
carried out using these primers with the cosmid DNA described in Example 1 as a template, whereby selecting cosmid 
clones carrying the F7 gene. The PCR was carried out in a reaction mixture of the same composition as that used in 
Example 5(1) using Pfu DNA polymerase as an enzyme in 30 cycles, wherein one cycle comprises a process consisting 

40 of at 94°C (30 seconds) - 55°C (30 seconds) - 72°C (3 minutes). As a result, there was found that Cosmid Clone Nos. 
15, 96. 1 14, 167, 277, 348, 386, 400, 419, 456, 457 and 484 carry the F7 gene. 

(2) Subcloning of F7 Gene 

45 [01 1 0] PCR was carried out using 20 pmol each of F7S2 and the cassette primer C2 with 1 uJ of the Hind\\\ cassette 
DNA prepared in Example 4 as a template. The PCR was carried out in a reaction mixture of the same composition as 
that used in Example 5(1) using Pfu DNA polymerase as an enzyme in 50 cycles, wherein one cycle comprises a proc- 
ess consisting of at 94°C (30 seconds) - 55°C (30 seconds) • 72°C (3 minutes). As a result, a fragment of about 900 bp 
was amplified. From this result, together with the result of amplification using F7-2 of Example 8(1) and the cassette 

so primer C2, the presence of the F7 gene in a Hind\\\ fragment of about 1 .0 kb was anticipated. With this in mind, No. 1 5, 
randomly selected from the cosmids carrying that gene, was digested with Hind\\\, and a DNA fragment of around 1 .0 
kb was cut out, and each was subdoned into plasmid vector pTV1 18N. PCR was carried out using the F7S1 and F7S2 
primers with each recombinant plasmid obtained as a template, to examine for the presence of the F7 gene, and as a 
result, H was found that a Hind\\\ fragment of 1 kb carries the F7 gene. A plasmid in which the F7 gene in this DNA frag- 

55 ment was ligated to downstream of the iac promoter of pTV1 1 8N vector was named pF7-HH-1 8, and a plasmid in which 
the F7 gene was ligated in the opposite orientation was named pF7-1-8. Also, a restriction endonuclease map was pre- 
pared for the DNA insert contained in this plasmid, and the map as shown in Figure 6 was obtained. 
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(3) Determination of Nucleotide Sequence of DNA Fragment Carrying F7 Gene 

[01 1 1 ] There was determined by the dideoxy method the nucleotide sequence of each insert in the above two kinds 
of plasmids, each insert in the plasmids being prepared by cutting out the 8amH\-Hind\\\, Nde\-Hind\\\. Hind\\\-Nde\ 

s and Hind\\\-BamH\ fragments from the above two kinds of plasmids. and subcloning the fragments into plasmid vector 
pTV1 19Nd. A sequence of 989 bp of the nucleotide sequence of the DNA insert of the above plasmid, determined on 
the basis of these overall results, is shown in SEQ ID NO: 41 in Sequence Listing. As a result of analyzing the nucle- 
otide sequence, there was found an open reading frame containing the N-terminal amino acid sequence of F7. The 
nucleotide sequence of this open reading frame is shown in SEQ ID NO: 2 in Sequence Listing, and the amino acid 

10 sequence of the F7 translation product as deduced from the nucleotide sequence is shown in SEQ ID NO: 1 in 
Sequence Listing. This amino acid sequence was searched for homology to the amino acid sequences of known pro- 
teins, and as a result, it was found that the amino acid sequence was homologous to the proliferating cell nuclear anti- 
gen (PCNA) involved in the DNA replication in eukaryotes [EMBOJ.. 11, 5111-5120 (1995); Nucleic Acids Research, 
18, 261-265 (1990); Proc. Natl. Acad. Sci. USA, 84, 1575-1579 (1987)]. The homology to the proteins described in the 

15 individual references were 24, 28 and 24%, respectively. 

(4) Preparation of Purified F7 Authentic Sample 

[0112] Escherichia coli JM109/&F7-HH-18, Escherichia coli JM109 transformed with the plasmid pF7-HH-18 
20 obtained in Example 8(2), was cultured for 16 hours in 2 liters of LB medium containing 100 ug/ml ampicillin. After har- 
vesting, cells were suspended in 45 ml of sonication buffer, and 41.9 ml of a heat-treated supernatant was obtained in 
the same manner as Example 2(1). Next, this solution was thrice subjected to 2-hour dialysis against 2 liters of buffer 
A as a dialysate. After dialysis, 36 ml of the enzyme solution was applied to RESOURCE Q column, previously equili- 
brated with buffer A, and the applied solution was chromatographed using FPLC system. Tlie elution was carried out 
25 on a linear concentration gradient of 0 to 500 mM NaCI. As a result. F7 was eluted at 340 mM NaCI. 

[01 1 3] Ten milliliters of the solution obtained by collecting the F7 fraction was concentrated using Centrif low CF-50, 
and thereafter subjected to exchange with buffer A containing 1 M (NH4) 2 S0 4 using PD-10 column, and 3.5 ml of the 
solution obtained was applied to HiTrap Phenyl column, previously equilibrated with the same buffer. Using FPLC sys- 
tem, the column was sequentially washed with 1 M and 0.5 M (NHUfeSO* and thereafter F7 was eluted with buffer A. 
30 Four milliliters of this fraction was concentrated using Centricon-1 0, and 80 \i\ of this concentrate was applied to Super- 
dex 200 gel filtration column, previously equilibrated with 50 mM potassium phosphate buffer (pH 6.5) containing 2 mM 
2-mercaptoethanol and 75 mM NaCI. As a result of elution with the same buffer, F7 was eluted at a position correspond- 
ing to a molecular weight of about 99 kilodaltons. This molecular weight corresponds to the case where F7 has formed 
a timer. 

35 

(5) Effects of F7 on Primer Extension Reactions 

[0114] In order to examine for the effects of F7 on the primer extension reactions to various polymerases, the activ- 
ities of Pfu polymerase C, Pfu DNA polymerase (a-type DNA polymerase, manufactured by STRATAGENE) and Pyro- 
40 dictium occufo/m-derived Poc DNA polymerases I and II [Poc DNA polymerases I and II, J. Bacterid., 177, 2164-2177 
(1995)] were compared with regard to the presence or absence of the addition of F7. 

[01 1 5] Determination of DNA polymerase activities were carried out with reference to the Pfu polymerase C activity 
determination described in Example 2(1). The substrate used was the constructs (M13-HT primer) as prepared by 
annealing the HT primer, a synthetic oligonucleotide having 45 bases, to M13 phage single-stranded DNA (M13mp18 
45 ssDNA, manufactured by Takara Shuzo Co., Ltd.). The nucleotide sequence of the HT primer is shown in SEQ ID NO: 
42 in Sequence Listing. 

[011 6] Concretely, a reaction mixture [20 mM Tris-HCI, pH 7.7, 15 mM MgCI 2 , 2 mM 2-mercaptoethanol, 0.01 ng/uJ 
M13-HT primer, 40 \M each of dATP, dCTP, dOTP and dTTP, 60 nM [ 3 H]k1TTP (manufactured by Amersham)] making 
up a final volume of 50 nl and containing each DNA polymerase listed in Table 2 and F7 was prepared and reacted at 
so 75°C for 5 minutes. After the reaction mixture was cooled with ice to stop the reaction, a 40 *il portion was spotted onto 
DE paper (manufactured by Whatman) and washed 5 times with 5% Na 2 HP0 4 . and thereafter the remaining radioac- 
tivity on the DE paper was determined using a liquid scintillation counter. 

[0117] As shown in Table 2, for all the DNA polymerases used, an increase in DNA polymerase activity due to the 
addition of F7 was observed. 

55 
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Table 2 





DNA Polymerase 


F7 


Enzyme Activity (cpm) 


5 


Rlank 1 
Dial irv i 






61 




DICII UV £. 




10pmol 


35 




Pfn PolvmnraRA 






888 


m 

IV 


Pfu Polymerase C 

1 1(4 1 vljlllvUlwC V/ 


(25fmol) 


5pmol 


2897 




Pfu Polymerase C 


(25fmol) 


10pmol 


3175 




Pfu DNA Polymerase 


(120fmol) 




907 


15 


Pfu DNA Polymerase 


(120fmol) 


0.48pmol 


1363 


Pfu DNA Polymerase 


(120fmol) 


4.8pmol 


1637 




Poc DNA Polymerase 1 


(74pmol) 




62 




Poc DNA Polymerase 1 


(74pmoI) 


10pmol 


69 


20 


Poc DNA Polymerase II 


(6.0pmol) 




433 




Poc DNA Polymerase II 


(6.0pmol) 


10pmol 


1443 


25 


Note: In the table, the amount of Pfu polymerase C is the amount of a protein comprising one 
molecule each of the two DNA polymerase-constituting proteins, and the amount of F7 is the 
amount as a trimer protein. 



[0118] Primer extension activity was further studied in detail. The M13-HT primer, previously labeled at the 5-ter- 
minus of the primer using [r-^PJ-ATP (manufactured by Amersham) and T4 polynucleotide kinase (manufactured by 

30 Takara Shuzo Co., Ltd.), was used as a substrate. 

[01 19] A 1 nl sample solution containing each of the following samples was prepared: 1) 18 fmol of Pfu polymerase 
C, 2) 18 fmol of Pfu polymerase C + 2 pmol of F7, 3) 0.24 pmol of Pfu DNA polymerase, 4) 0.24 pmol of Pfu DNA 
polymerase + 0.78 pmol of F7. To each sample solution, 9 pi of a reaction mixture [20 mM Tris-HCI (pH 9.0), 15 mM 
MgCI 2 , 2 mM 2-mercaptoethanol, 40 uM each of dATP, dGTP, dCTP and dTTP] containing 0.01 jig/jil 32 P-labeled M13- 

35 HT primer was added, and a reaction was carried out at 75°C for 2.5 minutes or 5 minutes. After termination of the reac- 
tion, the reaction mixture was cooled with ice to stop the reaction, and 1 uJ of 200 mM EDTA and 5.5 uJ of a reaction 
stop solution (95% fbrmamide, 20 mM EDTA, 0.05% bromophenol blue, 0.05% xylene cyanol) were added, and thermal 
denaturation treatment was carried out at 95°C for 5 minutes. After 1 .6 ul of this reaction mixture was electrophoresed 
using 6% polyacrylamide gel containing 8 M urea, an autoradiogram was prepared. The autoradiogram obtained is 

40 shown in Figure 7. 

[0120] In the figure, Pfu-C and pfu show the results obtained with Pfu polymerase C and Pfu DNA polymerase, 
respectively, and 2.5 and 5 show the respective reaction time (minutes). In addition, the symbols - and + in the figure 
show the results obtained with the reaction mixture in the absence and presence of F7. respectively. Further, the lanes 
on both ends of the figure show the results of electrophoresis of JL-EcoT14l digest (manufactured by Takara Shuzo Co.. 

45 Ltd.). previously labeled at the 5'-terminus using [r^PJ-ATP (manufactured by Amersham) and T4 polynucleotide 
kinase (manufactured by Takara Shuzo Co., Ltd.). and were used to deduce the lengths of the extension products. 
[01 21 ] As shown in Figure 7, when F7 is not added, in Pfu polymerase C. DNAs of about 300 to 600 bases are the 
major extension products obtained, whereas when F7 is added, extension products of low chain length decreases and 
the ratio of extension products exceeding 1.000 bases increases. Also in Pfu DNA polymerase, the chain length of 

50 extension products was markedly extended by the addition of F7. It was thus elucidated that F7 increases the primer 
extension rates of both Pfu polymerase C and Pfu DNA polymerase. 

[0122] Next, in order to analyze primer extension reaction products of higher molecular weights, the primer exten- 
sion reaction products of Pfu polymerase C and Pfu DNA polymerase with the 32 P-labeled M13-HT primer as a sub- 
strate were analyzed by alkaline agarose gel electrophoresis. To 1 \i\ of a solution of each of samples 1) to 4) above, 9 
55 ul of a reaction mixture (20 mM Tris-HCI, pH 9.0. 1 5 mM MgCI 2 , 2 mM 2-mercaptoethanol, 40 uM each of dATP. dGTP, 
dCTP and dTTP, 84 nM [a- 32 P]-dCTP) was added so as to have a final concentration of 0.01 ug/jil M13-HT primer, and 
a reaction was carried out at 75°C for 2.5 minutes. After termination of the reaction, to the ice cooled reaction mixture, 
1 .1 1 ul of 200 mM EDTA, 1 .23 |d of 500 mM NaOH and 2.47 ul of 6-fbld concentrated loading buffer (0.125% bromophe- 
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nol blue. 0.125% xylene cyanol. 9% glycerol) were sequentially added. After 6 ul of this mixture was eiectrophoresed 
using 0.5% alkaline agarose get, an autoradiogram was prepared. The autoradiogram obtained is shown in Figure 8. 
[0123] In the figure, Pfu-C and pfu show the results obtained with Pfu polymerase C and Pfu DNA polymerase, 
respectively, and the symbols - and + in the figure show the results obtained without or with addition of F7, respectively. 

5 Further, in the figure, Lane M is for the X- EcoT1 41 digest, previously labeled at one end in the same manner as above. 
As shown in Figure 8. in the case of Pfu polymerase C, a weak extension product signal was observed near 2.5 kb in 
the absence of F7, whereas a 7.3 kb signal completely encircling M13 ssDNA was observed in the presence of F7. In 
addition, in the case of Pfu DNA polymerase, a signal was observed near 2.7 kb in the presence of F7, whereas no sig- 
nal was observed in the absence of F7. These findings demonstrate that F7 enhances the extension reactions of the 

io two DNA polymerases. 

ExamplQ 9 

(1) Selection of Cosmid Clones Carrying Gene Encoding Homologs of RFC Small Subunit 

15 

[0124] Regarding the amino acid sequence of the RFC small subunit of Methanococcus jannaschii [Science, 273, 
1 058-1 073 (1996)], homology to the amino acid sequences of RFC (RF-C) small subunits derived from other organisms 
was examined. On the basis of the amino acid sequences of regions highly conserved thereamong, the primers RF-F1 , 
RF-F3, RF-F4, RF-R1 , RF-R2, RF-R3 and RF-R4 for searching the gene encoding the RFC small subunit were synthe- 

20 sized. The nucleotide sequences of these primers are shown in SEQ ID IMOs: 43 to 49, respectively, in Sequence List- 
ing. PCR was carried out using various combinations of these primers with Pyrococcus furiosus genomic DNA as a 
template, whereby searching for the gene encoding the RFC small subunit. The PCR was carried out in a reaction mix- 
ture of the same composition as that used in Example 5(1) using Pfu DNA polymerase, and using 0.25 fig of template 
DNA and 100 pmol of each primer. When first PCR was carried out using RF-F1 and RF-R4, second PCR was carried 

25 out using RF-F4 and RF-R4, or RF-F1 and RF-R1 , with 1 ul of the reaction mixture as a template. When first PCR was 
carried out using RF-F1 and RF-R3, second PCR was carried out using RF-F3 and RF-R2 with 1 ul of the reaction mix- 
ture as a template. Amplified DNA fragments of about 240 bp, about 140 bp and about 140 bp, respectively, were 
obtained. Each of these DNA fragments was subcloned into plasmid vector pUC1 19, and its nucleotide sequence was 
determined. On the basis of the sequences determined, the primers RF-S1 , RF-S2, RF-S3, RF-S4 and RF-S5, of which 

30 nucleotide sequences are shown in SEQ ID NOs: 50 to 54, respectively, in Sequence Listing, were then synthesized. 
PCR was carried out using these RF-S1 and RF-S3 primers with the cosmid DNA prepared in Example 1 as a template, 
whereby selecting cosmid clones assumed to carry the gene encoding homologs of the RFC small subunit The PCR 
was carried out using the TaKaRa PCR amplification kit in 25 cycles, wherein one cycle comprises a process consisting 
of at 94°C (30 seconds) - 55°C (30 seconds) - 72°C (2 minutes). As a result, there was found that Cosmid Gone Nos. 

35 254, 310, 313, 377 and 458 carry the desired gene (PFU-RFC gene). 

(2) Subcloning of PFU-RFC Gene 

[0125] PCR was carried out using 100 pmol of RF-S1 and 20 pmol of the cassette primer C2, or 100 pmol of RF- 
40 S2 and 20 pmol of the cassette primer C2, with 1 ug each of the Xba\ and EcoRI cassette DNAs prepared in Example 
4 as a template. The PCR was carried out in a reaction mixture of the same composition as that used in Example 6(1) 
using the Pfu polymerase C enzyme in 50 cycles, wherein one cycle comprises a process consisting of at 94°C (30 sec- 
onds) - 55°C (30 seconds) - 72°C (3 minutes). As a result, a DNA fragment of about 2 kb was amplified by RF-S1 and 
the cassette primer C2 when the Xba\ cassette was used as a template, and a DNA fragment of about 1 .5 kb was ampli- 
45 f ied by RF-S2 and the cassette primer C2 when the EcoRI cassette was used as a template. Each of these DNA frag- 
ments was subcloned into plasmid vector pUC1 19, and the recombinant plasmids obtained were named pRFSXSl -26 
and pRFSES2-8. Restriction endonuclease maps of these plasmids were prepared, and as a result, it was anticipated 
that neither Nde\ nor BamHI site is present in the PFU-RFC gene. 

[0126] The cosmids of the five clones mentioned in (1) above were each digested with Nde\ and BaroHl, and the 
so eiectrophoretic patterns were examined. As a result, a common band was observed near 5 kb. Anticipating the pres- 
ence of the PFU-RFC gene in this DNA fragment, an Nde\-BamH\ fragment of about 5 kb from Clone No. 254 was cut 
out, and each was subcloned into pTV119Nd vector mentioned above. A transformant formed with the recombinant 
plasmid obtained was examined for the presence PFU-RFC gene by PCR using the RF-S1 and RF-S3 primers. As a 
result, there was found that this Nde\-BamH\ fragment carry the PFU-RFC gene. Therefore, the plasmid resulting from 
55 insertion of this Nde\-BamH\ fragment into pTV1 1 9Nd vector was named plasmid pRFS254NdB. In addition, a restric- 
tion endonuclease map of this plasmid was prepared, and the map as shown in Figure 9 was obtained. 
[0127] On the basis of the restriction endonuclease map shown in Figure 9, various fragments were cut out from 
pRFS254NdB by the method described below, and each was subcloned into pTV1 18N vector (manufactured by Takara 
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Shuzo Co., Ltd.). First, a DNA fragment of about 500 bp obtained by digesting pRFS254NdB with Xba\ and Sac\, a DNA 
fragment of about 2 kb obtained by digesting with Xba\ and Nco\ t and a DNA fragment of about 1.1 kb obtained by 
digesting with Nco\ and BamH\ was prepared, respectively, and each was mixed with pTV1 18N, previously linearized 
with Sad and BamH\, for ligation, whereby constructing a recombinant plasmid. This plasmid was named 
5 pRFS254SXNB. 

(3) Determination of Nucleotide Sequence of DNA Fragment Carrying PFU-RFC Gene 

[0128] The nucleotide sequence of the DNA insert in the plasmid pRFS254NdB obtained in Example 9(2) was 
10 determined by the dideoxy method. A sequence of 3,620 base pairs of the nucleotide sequence determined is shown 
in SEQ ID NO: 55 in Sequence Listing. The amino acid sequence of the protein encoded by this nucleotide sequence 
was deduced. As a result of comparing this amino acid sequence with those of known RFC small subunits, there was 
anticipated the presence of one intein in the amino acid sequence of PFU-RFC. This intein is encoded by Nos. 721 to 
2295 of SEQ ID NO: 55 in Sequence Listing. 

15 

(4) Construction of Intein-Eliminated PFU-RFC Expression Plasmid 

[0129] On the basis of the nucleotide sequence determined in Example 9(3). and the amino acid sequence of a 
known RFC small subunit and the nucleotide sequence of the gene encoding the subunit, the primers RF-CBAI and RF- 

20 CAAI, of which nucleotide sequences are shown in SEQ ID NOs: 56 and 57 in Sequence Listing, were synthesized. 
Inverse PCR was carried out using these two primers, each of which 5'-terminus was previously phosphorylated, with 
the above plasmid pRFS254SXNB as a template. For inverse PCR, TaKaRa Ex Taq was used to prepare 100 pi of a 
reaction mixture in accordance with the instructions for the enzyme. To this reaction mixture added with 15 ng of the 
plasmid pRFS254SXNB and 20 pmol each of the primers, the reaction was carried out in 30 cycles, wherein one cycle 

25 comprises a process consisting of at 94°C (30 seconds) - 55°C (30 seconds) - 72°C (3 minutes). An amplified DNA frag- 
ment obtained by the inverse PCR was blunt-ended using DNA blunting kit (manufactured by Takara Shuzo Co., Ltd.), 
and thereafter subjected to self -ligation, whereby constructing a plasmid, which was named the plasmid pRFS254ISAI. 
[0130] Furthermore, an Xba\-Nco\ fragment of about 400 bp isolated after digestion of the plasmid with Xba\ and 
A/col was mixed with and an Xba\-Sac\ fragment of about 500 bp and an Nco\-BamH\ fragment of about 1 .1 kb, each 

30 isolated from the plasmid pRFS254NdB obtained in Example 9(2), and the mixed fragments were subcloned between 
the BamH\ and Sacl sites of plasmid vector pTV118N. The recombinant plasmid obtained as described above was 
named pRFS254SNc. Escherichia coil JM109 transformed with the plasmid was named Escherichia coli 
JM109/&RFS254SNC. It was found that the transfbrmant expresses PFU-RFC at high level. 

35 (5) Determination of Nucleotide Sequence of Gene Encoding PFU-RFC Without Carrying Intein 

[01 31 ] An Xba\- Nco\ fragment of about 400 bp derived from the plasmid pRFS254SXNB obtained in Example 9(4) 
was subcloned into plasmid vector pTV1 1 8N, and the nucleotide sequence of the DNA insert was determined, whereby 
the nucleotide sequence encoding the boundary portion of the intein eliminated was confirmed. From this result and the 
40 results of Example 9(3), the nucleotide sequence of the gene encoding PFU-RFC without carrying intein was deter- 
mined. The nucleotide sequence of the open reading frame encoding PFU-RFC without carrying intein obtained as 
described above and the amino acid sequence of PFU-RFC deduced from the nucleotide sequence are shown in SEQ 
ID NOs: 4 and 3, respectively, in Sequence Listing. 

45 (6) Preparation of Purified PFU-RFC Authentic Sample 

[0132] Escherichia coli JM109/fc>RFS254Nc obtained in Example 9(4) was cultured for 16 hours in 2 liters of LB 
medium containing 100 *ig/ml ampicillin. After harvesting, cells were suspended in 44.1 ml of sonication buffer, and 35.2 
ml of a heat-treated supernatant was obtained in the same manner as Example 2(1). Next, this solution was applied to 
so RESOURCE Q column, previously equilibrated with buffer D, and the applied solution was chromatographed using 
FPLC system. PFU-RFC was flowed through RESOURCE Q column. 

[01 33] Thirty-five milliliters of the flow-through PFU-RFC fraction was applied to RESOURSE S column (manufac- 
tured by Pharmacia), previously equilibrated with buffer D. Using FPLC system, the elution was carried out on a linear 
concentration gradient of 0 to 500 mM NaCI to yield a PFU-RFC fraction eluted at 1 70 mM NaCI. 2.9 ml Of this fraction 
55 was concentrated using Centricon- 1 0, and 1 05 p\ of the concentrate obtained was applied to Superdex 200 gel filtration 
column, previously equilibrated with 50 mM Tris-HCI buffer, pH 8.0, containing 2 mM 2-mercaptoethanol and 75 mM 
NaCI. The elution was carried out with the same buffer, and as a result, PFU-RFC was eluted at a position correspond- 
ing to a molecular weight of about 1 50 kilodaltons. This molecular weight corresponds to the case where PFU-RFC has 
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formed a tetramer. 

(7) Effects of PFU-RFC on Primer Extension Reaction 

5 [01 34] The effects of PFU-RFC andF7 on the primer extension reaction by Pf u polymerase C were examined in the 
same manner as Example 8(5). The results are shown in Table 3. As shown in Table 3, PFU-RFC slightly enhanced the 
activity of Pfu polymerase C. Furthermore, in the case where PFU-RFC was added simultaneously with F7, the 
enhanced activity more than doubled than the case where F7 was added alone. 

w 

Table 3 



Pfu Polymerase C 


F7 


PFU-RFC 


Enzyme Activity (cpm) 








100 


90fmol 






366 


90fmol 


9.6 pmol 




2743 


90fmol 




356 fmol 


463 


90fmol 


9.6 pmol 


356 fmol 


8740 


Note: In the table, the amount of Pfu polymerase C is the amount as a protein comprising one mole- 
cule each of the two DNA polymerase-constituting proteins, and the amounts of F7 and PFU-RFC are 
the amounts as a trimer and tetramer proteins, respectively. 
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Example iq 

(1) Preparation of Anti-Pfu DNA Polymerase Antibody 

30 [0135] Twelve milliliters (30,000 units) of cloned Pfu DNA polymerase (manufactured by STRATAGENE) was con- 
centrated by ultrafiltration using Centricon-10, and thereafter 0.1 ml of the concentrate obtained was applied to Super- 
dex 200 gel filtration column (manufactured by Pharmacia), previously equilibrated with 50 mM Tris-HQ (pH 8.0) 
containing 2 mM 2-mercaptoethanol and 75 mM NaCI. The elution was carried out with the same buffer, and a Pfu DNA 
polymerase fraction eluted at a position corresponding to a molecular weight of about 76 kilodaltons was recovered. 

35 After 0.8 ml of this fraction was concentrated using Centricon-10. this concentrate was used as an antigen to prepare 
an anti-Pfu DNA polymerase polyclonal antibody. The above concentrate was diluted with physiological saline so as to 
have a Pfu DNA polymerase concentration of 2 mg/ml, and the diluted solution was emulsified with an equal volume of 
Freund's complete adjuvant. This emulsion was subcutaneously injected to rabbits at 250 \i\ per injection tour times at 
3-week intervals. Ten days after final immunization, whole blood was extracted. After allowing to stand at room temper- 

ao ature for 60 minutes, the extracted blood was centrHuged to yield 60 ml of an antiserum containing the anti-Pfu DNA 
polymerase polyclonal antibody. To 26 ml of this antiserum, 26 ml of a saturated solution of ammonium sulfate was 
added, and the mixture was gently stirred at 4°C for 1 hour and 45 minutes, and subsequently centrifuged. The precip- 
itate was suspended in 5 ml of 20 mM sodium phosphate buffer (pH 7.0) and desalted using PD-10 column (manufac- 
tured by Pharmacia), previously equilibrated with the same buffer. Ten milliliters of this solution was applied to Protein 

45 A column (manufactured by Pharmacia), previously equilibrated with 20 mM sodium phosphate buffer (pH 7.0). After 
the column was washed with the same buffer, the elution was carried out with 0.1 M sodium citrate buffer (pH 3.0). The 
eluted fraction containing the anti-Pfu DNA polymerase polyclonal antibody was neutralized with 1 M Tris-HCI, pH 9.0. 
and thereafter the mixture was concentrated using Centriflow CF-50 and subjected to exchange with coupling buffer 
(0.5 M NaCI. 0.2 M NaHC0 3 , pH 8.3) using PD-10 column to prepare a solution containing the anti-Pfu DNA polymer- 

so ase antibody. 

(2) Preparation of Anti-Pfu DNA Polymerase Antibody Column 

[0136] HiTrap NHS-activated column (manufactured by Pharmacia) was washed with 6 ml of 1 mM HQ, and there- 
55 after 0.9 ml of the above anti-Pfu DNA polymerase polyclonal antibody solution (containing 4.5 mg equivalent of the 
anti-Pfu DNA polymerase antibody) was applied. Subsequently, an anti-Pfu DNA polymerase antibody column was pre- 
pared in the same manner as Example 2(3). 



25 



EP 0 997 530 A1 



(3) Confirmation of Formation of Complex of Pfu DNA Polymerase and F7 Using Anti-Pfu DNA Polymerase Antibody 
Column 

[0137J Pyrococcus furiosus DSM3638 was cultured in the same manner as the method described in Example 1 to 
5 yield cells in 9 liters of a culture medium. These cells were suspended in 33 ml of buffer C (50 mM Tris-HCI, pH 8.0, 0.1 
mM ATP) containing 2 mM PMSF, and the resulting suspension was treated with an ultrasonic disrupter. The disrupted 
solution obtained was centrifuged at 12,000 rpm for 1 0 minutes, and 44 ml of the supernatant obtained was applied to 
the anti-Pfu DNA polymerase antibody column, previously equilibrated with buffer C. The column was washed with 
buffer C containing 0.1 M NaCI, and thereafter the Pfu DNA polymerase complex was eluted with elution buffer (50 mM 
10 Tris-HCI, pH 8.0, 8 M urea). This eluate was subjected to SDS-PAGE (12.5% polyacrylamide gel; 25 mM Tris-HCI, 192 
mM glycine, 0.1% SDS, pH 8.4 used as electrophoresis buffer). The gel after electrophoresis was stained with Coomas- 
sie brilliant blue R-250 by a conventional method. As a result, as shown in Figure 10, besides the band of Pfu DNA 
polymerase, a band was detected at a position corresponding to the above F7. 

[0138] With this in mind, a concentrate of this eluate was subjected to SDS-PAGE in the same manner as above, 
is and the gel obtained was subjected to Western blotting using the anti-Pfu DNA polymerase antibody in the same man- 
ner as Example 3(2). From the result of SDS-PAGE shown in Figure 10 and the results of the above Western blotting, 
there was elucidated that the band at a position corresponding to F7 is a protein unreactive with the anti-Pfu DNA 
polymerase antibody. 

[0139] Furthermore, the N-terminal amino acid sequence of the protein of this band was analyzed in the same man- 
20 ner as Example 3(2), and as a result, it was found that this protein is F7. 

(4) Confirmation of Formation of Complex of Pfu DNA Polymerase and F7 Using Gel Filtration Chromatography 

[0140] 1 .2 ml Of the F7 authentic sample obtained in Example 8(4) was subjected to buffer-exchange with 50 mM 
25 Tris-HCI (pH 8.0) containing 2 mM 2-mercaptoethanol and 75 mM NaCI using PD-1 0 column, and thereafter the result- 
ing solution was concentrated to a volume of 50 id using Centricon-10. 

[0141] Ten microliters each of the 0.1 mM Pfu DNA polymerase solution described in Example 10(1), the above 0.1 
mM (calculated as a trimer) F7 solution, and a mixture of 0.1 mM Pfu DNA polymerase and 0.1 mM F7, was heated 
from 60° to 90°C over a period of 30 minute. Each heat-treated solution was applied to Superdex 200 PC3.2/30 gel fil- 

30 tration column (manufactured by Pharmacia), previously equilibrated with 50 mM Tris-HCI buffer, pH 8.0, containing 2 
mM 2-mercaptoethanol and 75 mM NaCI, and the elution was carried out with the same buffer. Pfu DNA polymerase 
and F7 were eluted at positions corresponding to molecular weights of about 76 kilodaltons and about 128 kilodaftons, 
respectively. In the case of the mixture of Pfu DNA polymerase and F7, a main peak corresponding to about 320 kilo- 
daltons and a minor peak corresponding to about 128 kilodaltons were eluted. The fractions with these two peaks were 

as each subjected to SDS-PAGE (12.5% polyacrylamide gel; 25 mM Tris-HCI, 192 mM glycine, 0.1% SDS, pH 8.4 used as 
electrophoresis buffer). The fraction corresponding to about 320 kilodaltons contained Pfu DNA polymerase and F7, 
whereas the fraction corresponding to about 128 kilodaltons contained F7 only. From the above, there was found that a 
complex of Pfu DNA polymerase and F7 is formed. 

40 (5) Extension Activity of Pfu DNA Polymerase-F7 Complex 

[0142] In the gel filtration described in Example 1 0(4), 20 nl each of the eluates obtained by gel filtration of Pfu DNA 
polymerase alone corresponding to about 76 kilodaltons, and of the mixture of Pfu DNA polymerase and F7 corre- 
sponding to 320 kilodaltons, were each collected, and the primer extension activity of each eluate or mixture was deter- 

45 mined by the activity determination method described in Example 8(5) where the non-labeled M13-HT primer was used 
as a substrate. Also, at the same time, incorporation activity was determined by the method described in Example 2(1) 
where an activated DNA was used as a substrate. The results are shown in Figure 1 1 . The ratio of the primer extension 
activity to the incorporation activity for the two fractions was determined such that the ratio of 0.65 was obtained for the 
about 320 kilodalton fraction, and the ratio of 0.29 was obtained for the about 76 kilodalton fraction. Therefore, there 

so was found that the primer extension activity of Pfu DNA polymerase is enhanced by the formation of a complex with F7. 

Example 11 

(1) Selection of Cosmid Clones Carrying Gene Encoding Homologs of RFC Large Subunit 

55 

[0143] Regarding the amino acid sequence of the RFC large subunit of Methanococcus jannaschii [Science, 273, 
1058-1073 (1996)], homology to the amino acid sequences of PFU-RFC small subunits without carrying intein 
described in Example 9 was examined. In reference to the amino acid sequence of a region highly conserved among 
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them, the primer RFLS15 for searching the gene encoding the RFC large subunit was synthesized. The nucleotide 
sequence of the primer RFLS1 5 is shown in SEQ ID NO: 60 in Sequence Listing. PCR was carried out using a combi- 
nation of this primer with the above primer RF-F1 corresponding to a similar amino acid sequence existing in the two 
subunit proteins of RFC with Pyrococcus furiosus genomic DNA as a template. The PCR was carried out using a reac- 

5 tion mixture of the same composition as that used in Example 5(1 ) using Pfu DNA polymerase. 025 ug of template DNA 
and 100 pmol each of primers. Of the two kinds of DNA fragments amplified by this PCR, an amplified DNA fragment 
of about 630 bp, of which size differs from the anticipated size of the amplification product derived from the PFU-RFC 
small subunit gene was isolated. This DNA fragment was subcloned into plasmid vector pUC119, and its nucleotide 
sequence was determined. Thereafter, in reference to the nucleotide sequence determined, the primers RFLS-S3 and 

10 RFLS-S4, of which nucleotide sequences are shown in SEQ ID NOs: 61 and 62 in Sequence Listing, were then synthe- 
sized. 

[0144] PCR was carried out using these two primers with the cosmid DNA prepared in Example 1 as a template, 
whereby selecting cosmid clones assumed to carry the gene encoding homologs of the RFC large subunit. The PCR 
was carried out in a reaction mixture of the same composition as that used in Example 5(1) using Pfu DNA polymerase 

is as an enzyme in 30 cycles, wherein one cycle comprises a process consisting of at 94°C (30 seconds) - 55°C (30 sec- 
onds) - 72°C (2 minutes). As a result, Cosmid Clone Nos. 254, 310, 313. 377 and 458 were found to carry the desired 
gene (PFU-RFCLS gene). These Cosmid Clone Numbers were identical to the above cosmid clones carrying the PFU- 
RFC gene. With this in mind, the nucleotide sequence of the DNA insert in the plasmid pRFS254NdB shown in SEQ ID 
NO: 55 in Sequence Listing was examined, and it was found that a homolog (PFU-RFCLS) of the RFC large subunit 

20 was encoded by the open reading frame starting at No. 31 09 of the sequence immediately downstream of the PFU-RFC 
gene. However, this plasmid pRFS254NdB did not harbor a full length of the PFU-RFCLS gene. 

(2) Subdoning of PFU-RFCLS Gene 

25 [0145] In order to isolate a DNA fragment carrying the full length of the PFU-RFCLS gene, Clone No. 254 above 
was digested with Nhe\, and the various DNA fragments obtained were cut out, and each was subcloned into plasmid 
vector pTV118N (manufactured by Takara Shuzo Co., Ltd.). PCR was carried out using RFLS-S3 and RFLS-S4 as 
primers with each of the recombinant plasmids obtained as a template, in order to examine whether or not the PFU- 
RFCLS gene is present. As a result, an Nhe\ fragment of about 1 1 kb was found to carry the RFLS gene. The plasmid 

30 resulting from insertion of this Nhe\ fragment into pTV1 1 8N was named the plasmid pRFLSNh. In addition, a restriction 
endonuciease map of the DNA insert contained in this plasmid was prepared, and the results as shown in Figure 12 
were obtained. 

[0146] Furthermore, the nucleotide sequence of the DNA insert contained in this plasmid was determined by the 
dideoxy method. Of the nucleotide sequence determined, the nucleotide sequence of the open reading frame portion 
35 encoding PFU-RFCLS is shown in SEQ ID NO: 63 in Sequence Listing. The amino acid sequence of PFU-RFCLS 
deduced from the sequence is shown in SEQ ID NO: 64 in Sequence Listing. 

Example 12 

40 (1) Selection of Cosmid Clones Carrying F5 Gene 

[0147] On the basis of the N-terminal amino acid sequence of F5 obtained in Example 3. the primers F5-1 -1 and 
F5-2, of which nucleotide sequences are shown in SEQ ID NO: 65 and 66, respectively, in Sequence Listing, were syn- 
thesized. First PCR was carried out using 100 pmol each of F5-1-1 and the cassette primer C1 (manufactured by 

45 Takara Shuzo Co., Ltd.) with 1 ul of the Pstl cassette DNA prepared in Example 4 as a template. Second PCR was car- 
ried out using 100 pmol of both F5-2 and the cassette primer C2 (manufactured by Takara Shuzo Co., Ltd.) with 1 nJ of 
the above reaction mixture as a template. This second PCR was carried out using TaKaRa PCR amplification kit (man- 
ufactured by Takara Shuzo Co., Ltd.) in accordance with the instructions attached. An amplified DNA fragment of about 
900 bp was subcloned into plasmid vector pTV1 18N (manufactured by Takara Shuzo Co., Ltd.). The plasmid obtained 

so was named pF5P2, and its nucleotide sequence was determined. Thereafter, on the basis of the sequence determined, 
primers F5S1 and F5S2, of which nucleotide sequences are shown in SEQ ID NOs: 67 and 68, respectively, in 
Sequence Listing, were synthesized. PCR was carried out using these F5S1 and F5S2 with the cosmid DNA described 
in Example 1 as a template, whereby selecting cosmid clones carrying the F5 gene. This PCR was carried out using 
the TaKaRa PCR amplification kit in accordance with the instructions attached. As a result, there were found that Cos- 

55 mid Clone Nos. 15, 96. 114. 167, 277. 348, 386. 400. 419, 456, 457 and 484 carry the F5 gene. These Cosmid Clone 
Numbers were identical to the cosmid clones carrying the F7 gene. With this in mind, the nucleotide sequence shown 
in SEQ ID NO: 41 in Sequence Listing was examined, and it was found that a portion on or after No. 892, which is down- 
stream of the F7 gene on the sequence, carries a first half of the F5 gene. 
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(2) Subcloning of F5 Gene 

[0148] In order to subclone the F5 gene, a restriction endonuclease map for A/col, BamHl Pstl Hind\\\ and Nde\ 
(manufactured by Takara Shuzo Co., Ltd.) in the neighborhood of the F5 gene was prepared using the plasmid pF7-HH- 
5 18 obtained in Example 8 and the above plasmid pF5P2, and the results as shown in Figure 13 were obtained. 

[0149J On the basis of the restriction endonuclease map shown in Figure 13, Cosmid Clone No. 15 was digested 
with Ndel and a fragment of about 900 bp was cut out and subcloned into plasmid vector pTV1 18Nd. As to the recom- 
binant plasmid obtained, a plasmid resulting from insertion of the F5 gene in the orthodox orientation with respect to 
the lac promoter was named pF5NNF-1 . 

10 

(3) Determination of Nucleotide Sequence of DNA Fragment Carrying F5 Gene 

[0150] TTie nucleotide sequence of the DNA insert in the above plasmid pF5NNF-1 was determined by the dideoxy 
method. As a result of analyzing the nucleotide sequence determined, there was found an open reading frame encod- 
15 ing a protein of which N-terminal amino acid sequence is identical to that of F5. The nucleotide sequence of this open 
reading frame is shown in SEQ ID NO: 69 in Sequence Listing, and the amino acid sequence of F5 as deduced from 
the above nucleotide sequence is shown in SEQ ID NO: 70 in Sequence Listing. This amino acid sequence was 
searched for homology to the amino acid sequences of known proteins, and as a result, proteins homologous thereto 
were not found. 

20 

(4) Construction of Plasmid for F5 Expression 

[0151 J PCR was carried out using the primers F5Nco and F5CBam, of which nucleotide sequences are shewn in 
SEQ ID NOs: 71 and 72, respectively, in Sequence Listing, with the above plasmid pF5NNF-1 as a template. The PCR 

25 was carried out in a reaction mixture of the same composition as that used in Example 5(1 ) using Pf u DNA polymerase. 
Using 1 ng of a template DNA and 20 pmol each of both of the primers, the reaction was carried out in 25 cycles, 
wherein one cycle comprises a process consisting of at 94°C (30 seconds) - 55°C (30 seconds) - 72°C (2 minutes). An 
amplified DNA fragment of an about 640 base pairs was digested with Nco\ and BamHl (both manufactured by Takara 
Shuzo Co., Ltd.), and the fragment obtained was ligated with pET1 5b (manufactured by Novagen), previously linearized 

30 with Nco\ and BamHl. This plasmid was named pFSNBPET. Of the DNA insert in the plasmid, the region amplified by 
PCR was analyzed by the dideoxy method to determine its nucleotide sequence. There was confirmed that there is no 
mutation caused by PCR. 

[0152] Escherichia coii HMS174(DE3)/pF5NBPET, Escherichia coli HMS1 74(DE3) transformed with the plasmid 
pFSNBPET, was evaluated for F5 expression, and there was demonstrated that a protein of a molecular weight conre- 
35 sponding to F5 in the culture of the transformant is expressed. 

Example 13 

(1) Subcloning of F3 Gene 

40 

[0153J On the basis of the N-terminal amino acid sequence of F3 obtained in Example 3, the primers F3-1 and F3- 
3-1, of which nucleotide sequences are shown in SEQ ID NOs: 73 and 74 in Sequence Listing, were synthesized. First 
PCR was carried out using 100 pmol of the primer F3-1 and 20 pmol of the cassette primer C1 with 1 pi of the 
fl&/ll/Sat/3AI cassette DNA of Example 4 as a template. With 1 uJ of the above reaction mixture as a template, second 

45 PCR was carried out using F3-3-1 and the cassette primer C2. The PCR was carried out in a reaction mixture of the 
same composition as that used in Example 5(1 ) using Pfu DNA polymerase as an enzyme in 30 cydes for the first PCR 
and 25 cycles for the second, wherein one cycle comprises a process consisting of at 94°C (30 seconds) - 45°C (30 
seconds) - 72°C (2 minutes). An amplified DNA fragment of about 500 bp by this reaction was subcloned into plasmid 
vector pTV1 18N, and a part of its nucleotide sequence was determined by the dideoxy method using M4 and RV prim- 

so ers (manufactured by Takara Shuzo Co., Ltd.). On the basis of the sequence determined, the primers F3S1, F3S2. 
F3S3 and F3S4, of which nucleotide sequences are shown in SEQ ID NOs: 75, 76. 77 and 78 in Sequence Listing, were 
then synthesized. PCR was carried out using these F3S1 and F3S2 primers with the cosmid DNA prepared in Example 
1 as a template, and cosmid clones carrying the F3 gene were searched. As a result, there was found no cosmid clone 
assumed to carry the F3 gene. With this in mind, PCR was carried out using the primer F3S3 or F3S4 and the primer 

55 C2 with each cassette DNA of Example 4 as a template. As a result of mapping of the restriction endonuclease recog- 
nition sites in the neighborhood of the F3 gene, there was anticipated that the F3 gene is present in a fragment of about 
2.6 kb between the Sa/I site and the HindlU site. On the basis of the results, 4 ug of Pyrococcus furiosus genomic DNA 
was digested with Sa/I and Hind\\\, and thereafter a DNA fragment of about 2.6 kb was collected and subcloned into 
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pTV1 1 8N vector. PCR was carried out using the primer F3S4 and the primer RV-N (manufactured by Takara Shuzo Co., 
Ltd.) with each of the recombinant plasmids thus obtained as a template, to examine for the presence of the F3 gene. 
As a result, a plasmid harboring a 2.6 kb Sai\'Hind\\\ fragment carrying the F3 gene was obtained, and this plasmid was 
named the plasmid pF3SH92. Escherichia coli JM109/pF3SH92, Escherichia coli JM109 transformed with this plas- 
5 mid, was examined for F3 expression, and as a result, there was confirmed that a protein having a molecular weight 
corresponding to F3 is expressed. 

(2) Determination of Nucleotide Sequence of DNA Fragment Carrying F3 Gene 

io [0154] The nucleotide sequence of the DNA insert in the above plasmid pF3SH92 was determined by the dideoxy 
method. As a result of analyzing the nucleotide sequence determined, there was found an open reading frame encod- 
ing a protein of which N-terminal amino acid sequence is identical to that of F3. The nucleotide sequence of this open 
reading frame is shown in SEQ ID NO: 79 in Sequence Listing, and the amino acid sequence of F3 as deduced from 
the nucleotide sequence is shown SEQ ID NO: 80, respectively, in Sequence Listing. This amino acid sequence was 

is searched for homology to the amino acid sequences of known proteins, and as a result, the amino acid sequence is 
found to be homologous to Mycoplana ramosa-derlved acetyl polyamine aminohydrase [Journal of Bacteriology, 178. 
5781-5786 (1996)] and human histone deacetylase [Science, 272, 408-41 1 (1996)]. 

Example 14 

20 

[0155] In the following Example, the activities of commercially available enzymes are shown on the basis of the 
labeling for individual enzymes. Also, reaction mixtures containing commercially available enzymes were prepared in 
accordance with the manuals for the respective enzymes, or using the reaction buffers attached thereto, unless other- 
wise specified. PCR was carried out using GeneAmp PCR System 9600 (manufactured by Perkin-Elmer). 

25 

(1) Preparation of Anti-PFU-RFC Antibody 

[0156] The PFU-RFC authentic sample of Example 9(6) was diluted so as to have a concentration of 1 mg/100 uJ 
with 50 mM Tris-HCI, pH 8.0, 2 mM 2-mercaptoethanol and 75 mM NaCI, and the mixture was emulsified with an equal 

so volume of Freund's complete adjuvant. This emulsion was subcutaneously injected to rabbits at 50 ul per injection four 
times at 3-week intervals. Ten days after final immunization, whole blood was extracted. After allowing to stand at room 
temperature for 60 minutes, the extracted blood was centrifuged to yield 50 ml of an antiserum containing the anti-PFU- 
RFC polyclonal antibody. To 20 ml of this antiserum, 20 ml of a saturated solution of ammonium sulfate was added, and 
the mixture was gently stirred at 4°C for 45 minutes and subsequently centrifuged. TTie precipitate obtained was sus- 

35 pended in 5 ml of 20 mM sodium phosphate buffer, pH 7.0, and thrice subjected to 2-hour dialysis against 2 liters of the 
same buffer as a dialysate. After dialysis, 1 4 ml of the solution was applied to Protein A column (manufactured by Phar- 
macia), previously equilibrated with 20 mM sodium phosphate buffer (pH 7.0). After the column was washed with the 
same buffer, the elution was carried out with 0.1 M sodium citrate buffer (pH 3.0). After the anti-PFU-RFC antibody 
eluted was neutralized with 1 M Tris-HCI, pH 9.0, the mixture was then concentrated using Centriflow CF-50 and sub- 

40 jected to exchange with coupling buffer (0.5 M NaCI, 0.2 M NaHC0 3> pH 8.3) using PD-10 column to prepare a solution 
containing the anti-PFU-RFC antibody. 

(2) Preparation of Anti-PFU-RFC Antibody Column 

45 [0157] HiTrap NHS-activated column (manufactured by Pharmacia) was washed with 6 ml of 1 mM HCI, and there- 
after 0.95 ml of the above anti-PFU-RFC polyclonal antibody solution (containing 3.8 mg equivalent of the anti-PFU- 
RFC antibody) was applied thereto. Subsequently, an anti-PFU-RFC antibody column was prepared in the same man- 
ner as Example 2(3). 

so (3) Purification of Complex Containing PFU-RFC Using Anti-PFU-RFC Antibody Column 

[0158] Pyrococcus furiosus DSM3638 was cultured in the same manner as the method described in Example 1 to 
yield cells in 10 liters of culture medium. These cells were suspended in 33 ml of buffer C (50 mM Tris-HCI, pH 8.0, 0.1 
mM ATP) containing 2 mM PMSF, and the suspension was treated with an ultrasonic disrupter. The disrupted solution 
55 was centrifuged at 1 2,000 rpm for 10 minutes, and 38 ml of the supernatant obtained was applied to the anti-PFU-RFC 
antibody column, previously equilibrated with buffer C containing 0.1 M NaCI. After washing with buffer C containing 0.1 
M NaCI, the column was heated at 85°C for 1 hour, and the PFU-RFC complex was eluted with buffer C containing 0.1 
M NaCI. This eluate was subjected to SDS-PAGE (12.5% polyacrylamide gel; 25 mM Tris-HCI, 192 mM glycine, 0.1% 
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SDS, pH 8.4 used as electrophoresis buffer). The gel after electrophoresis was stained with Coomassie brilliant blue R- 
250 by a conventional method, and as a result in addition to the band of PFU-RFC, one band at a position for 33 kilo- 
dattons, which corresponds to the above F7, and two bands near 60 kilodaltons were detected. 
[0159] With this in mind, the N-terminal amino acid sequences of the proteins existing in these three bands were 
5 analyzed in the same manner as Example 3(2). As a result, as shown in Figure 14, the N-terminal amino acid sequence 
of the protein at a position corresponding to the above F7 was found to be identical to that of F7, and each of the N- 
terminal amino acid sequences of the two kinds of proteins near 60 kilodaltons was found to be identical to the above 
N-terminal amino acid sequence of the PFU-RFCLS. 

[0160] Next, the amounts of the PFU-RFC, PFU-RFCLS and F7 proteins in this eluate were quantified by the 
10 amount of Coomassie brilliant blue bound thereto. The eluate was subjected to SDS-PAGE (12.5% polyacrylamide gel; 
25 mM Tris-HCI, 1 92 mM glycine. 0.1% SDS, pH 8.4 used as electrophoresis buffer). The gel after electrophoresis was 
stained with Coomassie brilliant blue R-250 by a conventional method, and thereafter the band was cut out and treated 
with 500 nl of 70% formic acid to extract the Coomassie brilliant blue, and the absorbance at 630 nm was determined. 
On the basis of a calibration curve prepared using the F7 authentic sample of Example 8(4) and the PFU-RFC authentic 
is sample of Example 9(6), each of a known concentration, it was found that 208 *ig of PFU-RFC, 55 jig of PFU-RFCLS 
and 51 ng of the F7 protein were contained in 500 pJ of the eluate. The complex constituted by the three proteins PFU- 
RFC, PFU-RFCLS and F7 as described above is hereinafter referred to as RFC-N complex. 

(4) Effects of RFC-N Complex on Primer Extension Reactions 

20 

[0161] In order to examine the effects of the RFC-N complex obtained in Example 14(3) on the primer extension 
reactions of various polymerases, the activities of Pf u polymerase C and Pfu DNA polymerase (a-type DNA polymer- 
ase, manufactured by STRATAGENE) were compared between cases where the RFC-N complex was added and cases 
where only its constituent F7 was added. The DNA polymerase activities were determined in the same manner as the 

25 method described in Example 8(5). except that 50 fmol of Pfu polymerase C or Pfu DNA polymerase was used. For the 
determination of the DNA polymerase activities, one prepared by annealing the HT primer, which is a synthetic oligo- 
nucleotide of 45 bases, to M13 phage single-stranded DNA (M13mp18ss DNA, manufactured by Takara Shuzo Co., 
Ltd.), was used as shown in Example 8(5) (M1 3-HT primer). The nucleotide sequence of the HT primer is shown in SEQ 
ID NO: 42 in Sequence Listing. The results for Pfu DNA polymerase are shown in Figure 15. The amounts of F7 and 

30 the RFC-N complex added are expressed in the molar numbers of F7 and RFC-N complex contained in the reaction 
mixture. As shown in Figure 1 5. the RFC-N complex showed higher increase in the activity to Pfu DNA polymerase than 
that of F7 alone. 

[0162] Furthermore, the primer extension activity was studied by the method described in Example 8(5). Reaction 
mixtures for determination were prepared with the following compositions: 1) 100 fmd of F7, 2) 0.05 ni of the RFC-N 

35 complex (containing 60 fmol of F7), 3) 10 fmol of Pfu polymerase C, 4) 10 fmol of Pfu polymerase C + 100 fmol of F7, 
5) 100 fmol of Pfu polymerase C + 0.05 |il of the RFC-N complex, 6) 20 fmol of F7, 7) 0.02 ^1 of the RFC-N complex 
(containing 24 fmol of F7), 8) 10 fmol of Pfu DNA polymerase, 9) 10 fmol of Pfu DNA polymerase + 20 fmol of F7, 10) 
10 fmol of Pfu DNA polymerase + 0.02 uJ of the RFC-N complex. To 1 fit of each reaction mixture for determination, 9 
*il of a reaction mixture [20 mM Tris-HCI (pH 9.0), 15 mM Mgd 2 , 2 mM 2-mercaptoethanol, 40 jiM each of dATP, dGTP, 

40 dCTP and dTTP] containing 0.01 ng/*il 32 P-labeled M1 3-HT primer was added, and the reaction was carried out at 75°C 
for 2.5 minutes. After termination of the reaction, the reaction mixture was cooled with ice to stop the reaction, and 1 \j\ 
of 200 mM EDTA and 5 nl of a reaction stopper (95% formamide, 20 mM EDTA, 0.05% bromophenol blue, 0.05% xylene 
cyanol) were further added thereto, and the mixture was subjected to thermal denaturation treatment at 95°C for 5 min- 
utes. After 1.6 ^ of this reaction mixture was electrophoresed using 6% polyacrylamide gel containing 8 M urea, an 

45 autoradiogram was prepared. 

[0163] Next, in order to analyze primer extension reaction products of longer chains, the analysis was carried out 
by the method described in Example 8(5). To 1 jil of each of sample solutions 1) to 10) above. 9 *il of a reaction mixture 
[20 mM Tris-HCI, pH 9.0, 1 5 mM MgCfe, 2 mM 2-mercaptoethanol, 40 *iM each of dATP, dGTP, dCTP and dTTP, 84 nM 
[a- 32 P]-dCTP] containing M13-HT primer to have a final concentration of 0.01 ng/jil was added, and the mixture was 

so reacted at 75°C for 2.5 minutes. After termination of the reaction, to the ice cooled reaction mixture, 1 . 1 1 nl of 200 mM 
EDTA, 1 .23 til of 500 mM NaOH, and 2.47 uJ of 6-fold concentrated loading buffer (0.125% bromophenol blue, 0.125% 
xylene cyanol. 9% glycerol) were sequentially added. After 6 jil of this mixture was electrophoresed using 0.5% alkaline 
agarose gel, an autoradiogram was prepared. 

[0164] In either case of Pfu polymerase C and Pfu DNA polymerase, the amount of long-chain extension products 
55 increased in the case where the RFC-N complex was added as compared to the case of F7 alone. 

[0165] The chain lengths of the long-chain extension products were found to be up to about 7.2 kb, a full length of 
the template, in either of the polymerases used, in the case of F7 alone and of the RFC-N complex. 
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Example 15 Construction of Plasmid for rRFC-M Expression 
[0166] 

5 (1) A plasmid for simultaneously expressing PFU-RFCLS and PFU-RFC was constructed. In reference to the nucle- 
otide sequence determined in Example 1 1(2), the primer RFLS-NdeN, of which nucleotide sequence is shown in 
SEQ ID NO: 81 in Sequence Listing, and RFLS-S9, of which nucleotide sequence is shown in SEQ ID NO: 82, were 
synthesized. PCR was carried out using both of these primers with the above plasmid pRFLSNh as a template. The 
PCR was carried out in a reaction mixture of the same composition as that used in Example 5(1) using Pfu DNA 

10 polymerase as an enzyme, 10 ng of the plasmid pRFLSNh and 20 pmol each of the primers in 30 cycles, wherein 
one cycle comprises a process consisting of at 94°C (30 seconds) - 55°C (30 seconds) • 72°C (3 minutes). An 
Nde\-Pst\ fragment of about 920 bp isolated after digesting an amplified DNA fragment obtained by PCR with Nde\ 
and Psrl, a Psrl-f coRI fragment of about 600 bp isolated from the plasmid pRFLSNh obtained in Example 1 1 (2), 
and an EcoRI-BamHI fragment of about 2 kb isolated from the plasmid pRFS254SNc obtained in Example 9(4) 

is were mixed and subcloned between the Nde\ and BamH\ sites of plasmid vector pTV119Nd. The recombinant 
plasmid thus obtained was named pRFC10. In addition, Escherichia coii JM109 transformed with the plasmid was 
named Escherichia coli JM1094>RFC10. This transformant was found to possess a high level of expression of 
PFU-RFCLS and PFU-RFC. 

20 (2) Determination of Nucleotide Sequence of Genes Encoding PFU-RFCLS and PFU-RFC 

Of the DNA insert in the plasmid pRFC10 obtained in Example 15(1), the region amplified by PCR was ana- 
lyzed by the dideoxy method to determine its nucleotide sequence, and it was confirmed that there is no mutation 
caused by PCR. From this result and the results of Example 9(3) and Example 1 1(2), the nucleotide sequence of 
the gene encoding PFU-RFCLS and PFU-RFC without carrying intein was determined. The nucleotide sequence 

25 of the genes encoding PFU-RFCLS and PFU-RFC without carrying intein thus obtained is shown in SEQ ID NO: 
83 in Sequence Listing, and its restriction endonuclease map is shown in Figure 16. 

Example 16 Preparation of rRFC-M Authentic Sample 

30 [0167] Escherichia coli JM109/&RFC10 obtained in Example 15(1) was cultured for 16 hours in 500 ml x 4 of LB 
medium (10 g/l trypton, 5 g/l yeast extract, 5 g/l NaCI. pH 7.2), in which ampicillin was present at a concentration of 100 
ug/ml, and IPTG is present at 1 mM. After harvesting, cells were suspended in 35.9 ml of sonication buffer [50 mM Tris- 
HCI, pH 8.0, 2 mM 2-mercaptoethanol, 10% glycerol, 2 mM PMSF (phenyl methanesulfonyl fluoride)], and the suspen- 
sion was treated with an ultrasonic disrupter. After centrifugation at 12,000 rpm for 10 minutes, a heat treatment was 

35 carried out at 80°C for 1 5 minutes. Thereafter, centrifugation at 1 2,000 rpm for 1 0 minutes was again carried out to yield 
33.0 ml of a heat-treated enzyme solution. This solution was then applied to RESOUCE Q column (manufactured by 
Pharmacia), previously equilibrated with buffer A (50 mM Tris-HCI, pH 8.0, 2 mM 2-mercaptoethanol, 10% glycerol), 
and the applied solution was chromatographed using FPLC system (manufactured by Pharmacia). The elution was car- 
ried out on a linear concentration gradient of 0 to 500 mM NaCI. 

40 [01 68] As a result of analyzing the eluate by SDS-PAGE (1 2.5% pdyacrylamide gel; 25 mM Tris-HCI, 1 92 mM gly- 
cine, 0.1% SDS, pH 8.4 used as electrophoresis buffer), PFU-RFCLS and PFU-RFC were both eluted at an NaCI con- 
centration of 240 mM. When the eluate obtained from cells in which PFU-RFC was expressed alone as descrfoed in 
Example 9(6) was applied to RESOURCE Q column, the eluate was not adsorbed to RESOURCE Q column. On the 
other hand, when the eluate obtained from celts in which PFU-RFCLS and PFU-RFC were simultaneously expressed 

45 was applied to RESOURCE Q column, the eluate was adsorbed thereto, and PFU-RFCLS and PFU-RFC were simul- 
taneously eluted at an NaCI concentration of 240 mM, as described above. From the results, it was demonstrated that 
these two proteins have formed a complex. This complex is hereinafter referred to as rRFC-M complex. 
[0169] After 4.8 ml of an enzyme solution obtained by collecting the rRFC-M complex fraction was concentrated 
using Centriflow CF50, the concentrate was subjected to exchange with buffer A containing 150 mM NaCI using PD-1 0 

so column (manufactured by Pharmacia), and 3.5 ml of the solution was applied to Heparin column (manufactured by 
Pharmacia), previously equilibrated with buffer A containing 1 50 mM NaCI. Using FPLC system, the chromatogram was 
developed on a linear concentration gradient from 150 mM to 650 mM NaCI, and an rRFC-M complex fraction eluted at 
450 mM NaCI was obtained. Using Centricon-10 (manufactured by Amicon), 3.9 ml of this fraction was concentrated, 
and 1 15 jil of the concentrate was applied to Superdex 200 gel filtration column (manufactured by Pharmacia), previ- 

55 ously equilibrated with 50 mM Tris-HCI, pH 8.0, 2 mM 2-mercaptoethanol and 75 mM NaCI. The elution was carried out 
with the same buffer, and the rRFC-M complex was found to have a retention time of 26.3 minutes. Rom the compara- 
tive results with the position of the elution of a molecular weight marker under the same conditions, the molecular weight 
of the rRFC-M complex was calculated as about 370 kilodaftons. 
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[0170] Furthermore, in order to determine the compositional ratio of each unit in the rRFC-M complex, the above 
eluted fraction of a molecular weight of about 370 kDa was subjected to SDS-PAGE. 

[0171] The gel after electrophoresis was stained with Coomassie brilliant blue R-250 by a conventional method, 
and thereafter the bands of the PFU-RFCLS and PFU-RFC proteins were cut out and extracted with 500 nl of 70% for- 
s mic acid. The absorbance at 630 nm of each extract was determined, and the results were compared with the calibra- 
tion curve prepared by using PFU-RFC prepared in Example 9(6), and whereby the amount of each protein was 
determined and the molar number was calculated. 

[0172] As a result, PFU-RFCLS and PFU-RFC were found to exist in a 1 :4 ratio. Based on the fact that the molec- 
ular weight of the rRFC-M complex as calculated by the gel filtration described above was about 370 kDa, the rRFC-M 
io complex was assumed to be formed by two molecules of PFU-RFCLS and eight molecules of PFU-RFC. With this in 
mind, the molar number was calculated, taking the above rRFC-M complex as 1 unit. 

Example 17 Construction of Plasmid F3 Expression 

15 [0173] 

(1) PCR was carried out using the primer F3Nd, of which nucleotide sequence is shown in SEQ ID NO: 84 in 
Sequence Listing, and the F3S2 primer, of which nucleotide sequence is shown in SEQ ID NO: 76, with the plasmid 
pF3SH92 as prepared in Example 13 as a template. The PCR was carried out in a reaction mixture of the same 

20 composition as that used in Example 5(1 ) using Pfu DNA polymerase as an enzyme, 1 ng of the plasmid pF3SH92 
and 20 pmol each of the primers in 30 cycles, wherein one cycle comprises a process consisting of at 94°C (30 
seconds) - 55°C (30 seconds) - 72°C (1 minute). An Nde\-Pst\ fragment of about 0.5 kb isolated after digestion of 
an amplified DNA fragment obtained by PCR with A/del and Pst\, and a Psfl-EcoRI fragment of about 1.1 kb iso- 
lated from the plasmid pF3SH92 were mixed and subcloned between the Nde\ and Eco Rl sites of plasmid vector 

25 pTV119Nd. The recombinant plasmid thus obtained was named pF3-1 9. Inaddition, Escherichia coiiM^ 09 trans- 
formed with the plasmid was named Escherichia coii JM109/pF3-19. The transformant was found to possess high 
expression of F3. 

(2) Determination of Nucleotide Sequence of Gene Encoding F3 

30 Of the DNA insert in the plasmid pF3-19, obtained in Example 17(1), the region amplified by PCR was ana- 

lyzed by the dideoxy method to determine its nucleotide sequence, and confirmed that there is no mutation caused 
by PCR. 

Example 18 Preparation of Purified F3 Authentic Sample 

35 

[0174] Escherichia coli JM109/pF3-19 obtained in Example 17(1) was cultured for 16 hours in 500 ml x 4 of LB 
medium (10 g/liter trypton, 5 g/lrter yeast extract, 5 g/liter NaCI, pH 7.2) in which ampicillin was present at a concentra- 
tion of 100 jig/ml. After harvesting, cells were suspended in 50 ml of sonication buffer [50 mM Tris-HQ, pH 8.0, 2 mM 
2-mercaptoethanol, 1 0% glycerol, 2 mM PMSF (phenylmethanesulfonyl fluoride)], and the suspension was treated with 

40 an ultrasonic disrupter. After centrifugation at 1 2,000 rpm for 1 0 minutes, the supernatant was subjected to heat treat- 
ment at 80°C for 15 minutes. Thereafter, centrifugation at 12,000 rpm for 10 minutes was again carried out to yield a 
heat-treated supernatant. Forty-four milliliters of the heat-treated supernatant was applied to RESOURCE Q column 
(manufactured by Pharmacia), previously equilibrated with buffer A described in Example 16, and the applied solution 
was chromatographed using FPLC system (manufactured by Pharmacia). The chromatogram was developed on a lin- 

45 ear concentration gradient from 0 to 500 mM NaCI. To 1 1 ml of a solution of the fraction containing F3 eluted at 140 mM 
to 240 mM NaCI, 5.5 ml of buffer A containing 3 M ammonium sulfate was added, and this solution was applied to 
HiTrap butyl column (manufactured by Pharmacia), previously equilibrated with buffer A containing 1 M ammonium sul- 
fate. After the column was washed with buffer A containing 1 M ammonium sulfate using FPLC system, F3 was eluted 
with buffer A containing 0.5 M ammonium sulfate. Six milliliters of this fraction was applied to HiTrap phenyl column 

so (manufactured by Pharmacia), previously equilibrated with buffer A containing 0.5 M ammonium sulfate. After the col- 
umn was washed with buffer A containing 0.5 M ammonium sulfate using FPLC system, F3 was eluted with buffer A. 
Using Centricon-10 (manufactured by Amicon), 9.5 ml of this fraction was concentrated, and 155 jil of the concentrate 
was applied to Superdex 200 gel filtration column (manufactured by Pharmacia), previously equilibrated with 50 mM 
Tris-HCI, pH 8.0, 2 mM 2-mercaptoethanol and 75 mM NaCI. The elution was carried out with the same buffer, and as 

55 a result, F3 was eluted at a position corresponding to a retention time of 42.1 minutes. From the comparative results in 
the position of the elution of a molecular weight marker under the same conditions, a molecular weight of about 25 Wlo- 
dattons was anticipated. On the basis that the theoretical value of the molecular weight of F3 is 37 kilodaltons, F3 is 
deduced to be a monomer. 
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PYflmnlft 19 Preparation of Purified F5 Authentic Sample 

[0175] Escherichia coli HMS1 74(DE3)/pF5NBPET, Escherichia coii HMS1 74(DE3) transformed with the plasmid 
pFSNBPET obtained in Example 12(4), was cultured for 16 hours in 500 ml x 4 of LB medium (10 g/liter trypton, 5 g/liter 

5 yeast extract, 5 g/liter NaCI, pH 7.2) in which ampicillin was present at a concentration of 100 ug/ml. After harvesting, 
cells were suspended in 61 ml of sonication buffer, and the suspension was treated with using an ultrasonic disrupter. 
The disrupted cells were centrifuged at 12,000 rpm for 10 minutes, and thereafter the supernatant was subjected to 
heat treatment at 80°C for 15 minutes. Thereafter, centrifugation at 12,000 rpm for 10 minutes was again carried out to 
yield a heat-treated supernatant. To 60.5 ml ammonium sulfate, 8.71 g of ammonium sulfate was added, and the mix- 

10 ture was stirred at 4°C for 2 hours, and thereafter centrifugation at 1 2,000 rpm for 10 minutes was carried out. The pre- 
cipitate was dissolved in 19 ml of buffer A and dialyzed against buffer A. The enzyme solution after dialysis was applied 
to RESOURCE Q column (manufactured by Pharmacia), previously equilibrated with buffer A, and the applied solution 
was chromatographed using FPLC system (manufactured by Pharmacia). The chromatogram was developed on a lin- 
ear concentration gradient from 0 to 500 mM NaCI. Using Centricon-10 (manufactured by Amicon), 1 1 ml of a solution 

is of a fraction containing F5 eluted at 350 mM to 450 mM NaCI was concentrated, and 222 uJ of the concentrate was 
applied to Superdex 200 gel filtration column (manufactured by Pharmacia), previously equilibrated with 50 mM Tris- 
HQ, pH 8.0, 2 mM 2-mercaptoethanol and 75 mM NaCI. The elution was carried out with the same buffer, and as a 
result, F5 was eluted at a position corresponding to a retention time of 32.5 minutes. From the comparative results with 
the position of the elution of a molecular weight marker under the same conditions, a molecular weight of about 145 

20 kilodaltons was anticipated. This molecular weight corresponds to the case where F5 has formed a heptamer. 

Example 20 Preparation of Primers 

[0176] On the basis of the nucleotide sequence of XDNA, eight kinds of primers, i.e., VI B to X5 and X7 to X9, were 
25 synthesized. The nucleotide sequences of the primers A.1 B to XS and X7 to X9 are shown in SEQ ID NOs: 85 to 92, 
respectively, in Sequence Listing. The chain lengths of DNA fragments amplified by PCR using combinations of these 
primers with IDHA as a template are shown in Table 4. 



Table 4 



Primer Pairs 


Chain Length of DNA 
Fragment Amplified 


MB/X2 


0.5 kb 


X1B/X3 


1 kb 


MB/X4 


2kb 


MB/X5 


4kb 


X1B/A7 


8kb 


X1B/X8 


10 kb 


MB/X9 


12 kb 



45 Example 21 Eff ects of F1 Protein on DNA Polymerase 

[0177] The effects of the F1 protein obtained in Example 5 on PCR were examined. In order to carry out an ampli- 
fication reaction of 1 to 4 kb DNA fragments using A.DNA as a template, each of the primers MB and X3, the primers 
A,1 B and X4, and the primers X1 B and X5, were used as primer pairs to prepare reaction mixtures of the compositions 

so shown below: 10 mM Tris-HCI, pH 9.2, 75 mM KCI, 6 mM MgCfe, 0.4 mM each of dATP, dCTP, dGTP and dTTP, 0.01% 
BSA and 1 .25 units of Pfu polymerase C, 500 pg of template DNA, 5 pmol each of the primers. 1 73 pmol of the F1 pro- 
tein (final volume being 25 yd). Using each reaction mixture, the reaction was carried out in 30 cycles, wherein one cycle 
comprises a process consisting of at 98°C, 0 second - 68°C, 0 second. The phrases "98°C, 0 second", "68°C, 0 second" 
etc. as used in the present specification indicate that the reaction apparatus was programmed so that the setting tem- 

55 perature is immediately shifted to the next one when the setting temperature is reached. 

[0178] After termination of the reaction, 5 \i\ of the reaction mixture was electrophoresed on 1% agarose gel (man- 
ufactured by Takara Shuzo Co., Ltd.) to confirm amplified fragments. 

[0179] As a result, the amplification of 1 kb, 2 kb and 4 kb DNA fragments, depending on the primer pairs used, was 
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conf irmed. On the other hand, when the above reaction mixture without the addition of the F1 protein was subjected to 
PCR under the above reaction conditions, no amplified fragments could be confirmed. 

F*am plft 22 Effects of F1 . F3 and F5 Proteins on DNA Polymerase 

5 

[0180] The effects of the F1 protein obtained in Example 5, the F3 protein obtained in Example 18 and the F5 pro- 
tein obtained in Example 19 were used to investigate the amplification of a 6 kb DNA fragment by PCR with XDNA as 
a template. Reaction mixtures of the same compositions as those used in Example 21 were prepared, except that the 
primers X1 and XS were used as a primer pair. The F1 protein was added in an amount of 173 pmol, the F3 protein was 
10 added in an amount of 1 0 pmol, and the F5 protein was added in an amount of 1 pmol, respectively, to make up a final 
volume of 25 pi. Using each reaction mixture, the reaction was carried out in 30 cycles, wherein one cycle comprises a 
process consisting of at 98°C. 1 second - 68°C, 2 minutes. After termination of the reaction, 5 pi of the reaction mixture 
was electrophoresed on 1% agarose gel to confirm amplified fragments. 

[0181 ] As a result, the amplification of a 6 kb DNA fragment was confirmed in the presence of any of the F1 , F3 and 
is F5 proteins. On the other hand, when these proteins were not added, no amplified fragments could not be confirmed. 

Example 23 Effects of F2 and F4 Proteins on DNA Polymerase 

[0182] The effects of the F2 protein obtained in Example 6 and the F4 protein obtained in Example 7 were used to 
20 investigate the amplification reaction of a 4 kb DNA fragment by PCR with A.DNA as a template. Reaction mixtures of 
the same compositions as those used in Example 21 were prepared, except that the primers X1 B and AS, as a primer 
pair, 0.75 units of Pfu polymerase C and 1 ng of template XDNA were used. The F2 protein and the F4 protein were 
each added in an amount of 1 .095 pmol to the reaction mixture to make up a final volume of 25 -pi. Using each reaction 
mixture, the reaction was carried out in 25 cycles, wherein one cycle comprises a process consisting of at 94°C, 30 sec- 
25 onds - 55°C, 30 seconds - 72°C, 2 minutes. After termination of the reaction, 5 pi of the reaction mixture was electro- 
phoresed on 1 % agarose gel to confirm amplified fragments. 

[0183] As a result, the amplification of a 4 kb fragment was confirmed in the presence of any of the F2 and F4 pro- 
teins. On the other hand, when these proteins were not added, no amplified fragment was confirmed. 

so Example 24 Eff ects of rRFC-M Complex on DNA Polymerases 

[0184] In order to examine the effects of the rRFC-M complex on the primer extension reactions of various polymer- 
ases, the activities of Pfu polymerase C and Pfu DNA polymerase (a-type DNA polymerase, manufactured by STRAT- 
AGENE) were compared for cases where the rRFC-M complex and F7 are coexistent, and for cases where F7 exists 
35 alone. 

[0185] DNA polymerase activities were determined in the same manner as the method described in Example 8{5), 
except that 50 fmol of Pfu polymerase C or Pfu DNA polymerase was used, and that 400 fmol of the rRFC-M complex 
and 0 to 200 fmol of F7 were added. The results of the case of using Pfu DNA polymerase are shown in Figure 1 7. The 
effects on Pfu DNA polymerase were such that the activity was more elevated in the case of coexistence of the rRFC- 
40 M complex and F7 than the case of F7 alone. In addition, the effects on Pfu polymerase C showed the same tendency 
as those of Pfu DNA polymerase. 

Example 25 Effects of Coexistence of rRF C-M Complex and F7 Protein on PCR 

45 [0186] In order to carry out an amplification reaction of a 4 kb DNA fragment using A,DNA as a template, reaction 
mixtures of the same compositions as those used in Example 21 were prepared, except that the primers A.1 B and X5 
and 0.375 units of Pfu polymerase C were used. The rRFC-M complex was added in an amount of 312.5 fmol, and the 
F7 protein was added in an amount of 1 25 fmol, respectively, to the reaction mixture to make up a final volume of 25 pi. 
Using each reaction mixture, the reaction was carried out in 30 cycles, wherein one cycle comprises a process consist- 
so ing of at 98°C, 0 second - 68°C, 10 seconds. After termination of the reaction, 5 pi of the reaction mixture was electro- 
phoresed on 1% agarose gel (manufactured by Takara Shuzo Co., Ltd.) to confirm amplified fragments. 
[0187] As a result, the amplification of a 4 kb DNA fragment, depending on the primer pair used, was confirmed in 
the case of the system where the rRFC-M complex and the F7 protein were coexistent On the other hand, when these 
proteins were not added, no amplified fragments could be confirmed. 
55 [0188] Furthermore, a similar experiment was carried out for an amplification reaction of 8 to 1 2 kb DNA fragments 
using JiDNA as a template. Reaction mixtures of the same compositions as those used in Example 21 were prepared, 
except that each of the primers X.1 B and X7, the primers M B and X8, and the primers A,1 B and X9 were used as primer 
pairs, and further 0.375 units of Pfu polymerase C, and 2.5 ng of template XDNA were used. The rRFC-M complex was 
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added in an amount of 312.5 fmol, and the F7 protein was added in an amount of 125 fmol, respectively, to the reaction 
mixture to make up a final volume of 25 til. Using each reaction mixture, the reaction was carried out in 30 cycles, 
wherein one cycle comprises a process consisting of at 98°C, 0 second - 68°C, 3 minutes. After termination of the reac- 
tion, 5 jjiI of the reaction mixture was eiectrophoresed on 1% agarose gel (manufactured by Takara Shuzo Co., Ltd.) to 
5 confirm amplified fragments. 

[0189] As a result, the amplification of 8 kb, 1 0 kb and 1 2 kb DNA fragments, depending on the primer pairs used, 
was confirmed in the case of the system where the rRFC-M complex and the F7 protein were coexistent. On the other 
hand, when these proteins were not added, only a 8 kb DNA fragment was confirmed. 

io Example 26 Effects of Coexiste nce of rRFC-M Complex and F7 Protein on Pfu DNA Polymerase 

[0190] In order to carry out an amplification reaction of a 4 kb DNA fragment using A.DNA as a template, using each 
of the primers X1 B and A3, the primers X1 B and A.4, and the primers A.1 B and X5, as primer pairs, reaction mixtures of 
the compositions shown below were prepared: buffer supplied with Pfu DNA polymerase, 0.2 mM each of dATP, dCTP, 

is dGTP and dTTP, and 0.5 units each of Pfu polymerase, 500 pg of template DNA, 2.5 pmol of each primer, 2.5 pmol of 
the rRFC-M complex protein, and 0.5 pmol of the F7 protein (final volume being 25 Using each reaction mixture, the 
reaction was carried out in 25 cycles, wherein one cycle comprises a process consisting of at 94°C, 30 seconds - 55°C, 
30 seconds - 72°C, 1 minute. After termination of the reaction, 5 yl of the reaction mixture was eiectrophoresed on 1% 
agarose gel to confirm amplified fragments. 

20 [01 91 ] As a result, the ampl rf ication of 1 kb, 2 kb and 4 kb DNA fragments, depending on the primer pairs used, was 
confirmed in the case of the system where the rRFC-M complex and the F7 protein were coexistent. On the other hand, 
when these proteins were not added, only 1 kb to 2 kb DNA fragments were confirmed. 

Example 27 Effects of Coexistence of rR FC-M Comdex and F7 Protein on Mixed DNA Polymerase 

25 

[0192] The effects of the coexistence of the rRFC-M complex and the F7 protein on PCR using a mixture of two 
kinds of DNA polymerases were examined. 

[0193] In order to carry out an amplification reaction of a 1 kb DNA fragment using XDNA as a template, using the 
primers M B and 313 as a primer pair, reaction mixtures of the compositions shown below were prepared: buffer supplied 

30 with TaKaRa LA Taq (Mg Plus), 0.4 mM each of dATP, dCTP, dGTP and dTTP, 1 .25 units of LA Taq DNA polymerase 
(manufactured by Takara Shuzo Co., Ltd.), 500 pg of template DNA, 5 pmol of each primer, 62.5 fmol of the RFC com- 
plex protein, and 12.5 fmol of the F7 protein (final volume being 25 nl). Using each reaction mixture, the reaction was 
carried out in 30 cycles, wherein one cycle comprises a process consisting of at 98°C, 0 second - 68°C, 10 seconds. 
After termination of the reaction, 5 nl of the reaction mixture was eiectrophoresed on 1% agarose gel to confirm ampli- 

35 tied fragments. 

[0194] As a result, there can be confirmed that a DNA fragment of 1 kb was most efficiently amplified, in the case 
of the system where the rRFC-M complex and the F7 protein were added, as a result of comparison of the system 
where the rRFC-M complex and the F7 protein were added with the system where the rRFC-M complex alone was 
added, the system where the F7 protein alone was added, or the system where LA Taq DNA polymerase alone was 
40 added. 

INDUSTRIAL APPLICABILITY 

[0195] According to the present invention, there can be provided a DNA polymerase-associated factor capable of 
45 enhancing DNA synthesizing-activity of a DNA polymerase. The factor has an action on various DNA polymerases, and 
also can be utilized in various processes in which a DNA polymerase is used, so that the factor is useful as a reagent 
for studies in genetic engineering. Further, it is now possible to produce the enzyme by genetic engineering techniques 
using a gene encoding the DNA polymerase-associated factor of the present invention. 
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SEQUENCE LISTING 

SEQ ID NO: 1 
SEQUENCE LENGTH: 249 
SEQUENCE TYPE: amino acid 
STRANDEDNESS: single 
TOPOLOGY: linear 
MOLECULAR TYPE: peptide 
SEQUENCE DESCRIPTION: 

Met Pro Phe Glu He Val Phe Glu Gly Ala Lys Glu Phe Ala Gin 

5 10 15 

Leu He Asp Thr Ala Ser Lys Leu He Asp Glu Ala Ala Phe Lys 
20 25 30 

Val Thr Glu Asp Gly He Ser Met Arg Ala Met Asp Pro Ser Arg 
35 40 45 

Val Val Leu He Asp Leu Asn Leu Pro Ser Ser He Phe Ser Lys 
50 55 60 

Tyr Glu Val Val Glu Pro Glu Thr He Gly Val Asn Met Asp His 
65 70 75 

Leu Lys Lys He Leu Lys Arg Gly Lys Ala Lys Asp Thr Leu He 
80 85 90 

Leu Lys Lys Gly Glu Glu Asn Phe Leu Glu He Thr He Gin Gly 
95 100 105 

Thr Ala Thr Arg Thr Phe Arg Val Pro Leu He Asp Val Glu Glu 
110 115 120 

Met Glu Val Asp Leu Pro Glu Leu Pro Phe Thr Ala Lys Val Val 
125 130 135 

Val Leu Gly Glu Val Leu Lys Asp Ala Val Lys Asp Ala Ser Leu 
140 145 150 

Val Ser Asp Ser He Lys Phe He Ala Arg Glu Asn Glu Phe He 
155 160 165 

Met Lys Ala Glu Gly Glu Thr Gin Glu Val Glu He Lys Leu Thr 
170 175 180 

Leu Glu Asp Glu Gly Leu Leu Asp He Glu Val Gin Glu Glu Thr 
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30 



35 



185 190 195 

Lys Ser Ala Tyr Gly Val Ser Tyr Leu Ser Asp Met Val Lys Gly 

200 205 210 

Leu Gly Lys Ala Asp Glu Val Thr lie Lys Phe Gly Asn Glu Met 

215 220 225 

Pro Met Gin Met Glu Tyr Tyr He Arg Asp Glu Gly Arg Leu Thr 

230 235 240 

Phe Leu Leu Ala Pro Arg Val Glu Glu 

245 



SEQ ID NO: 2 
SEQUENCE LENGTH: 750 
SEQUENCE TYPE: nucleic acid 
20 STRANDED NES S : double 

TOPOLOGY: linear 
MOLECULAR TYPE: Genomic DNA 
SEQUENCE DESCRIPTION: 
25 ATGCCATTTG AAATCGTATT TGAAGGTGCA AAAGAGTTTG CCCAACTTAT AGACACCGCA 60 

AGTAAGTTAA TAGATGAGGC CGCGTTTAAA GTTACAGAAG ATGGGATAAG CATGAGGGCC 120 
ATGGATCCAA GTAGAGTTGT CCTGATTGAC CTAAATCTCC CGTCAAGCAT ATTTAGCAAA 160 
TATGAAGTTG TTGAACCAGA AACAATTGGA GTTAACATGG ACCACCTAAA GAAGATCCTA 240 
AAGAGAGGTA AAGCAAAGGA CACCTTAATA CTCAAGAAAG GAGAGGAAAA CTTCTTAGAG 300 
ATAACAATTC AAGGAACTGC AACAAGAACA TTTAGAGTTC CCCTAATAGA TGTAGAAGAG 360 
ATGGAAGTTG ACCTCCCAGA ACTTCCATTC ACTGCAAAGG TTGTAGTTCT TGGAGAAGTC 420 
CTAAAAGATG CTGTTAAAGA TGCCTCTCTA GTGAGTGACA GCATAAAATT TATTGCCAGG 480 
GAAAATGAAT TTATAATGAA GGCAGAGGGA GAAACCCAGG AAGTTGAGAT AAAGCTAACT 540 
CTTGAAGATG AGGGATTATT GGACATCGAG GTTCAAGAGG AGACAAAGAG CGCATATGGA 600 
GTCAGCTATC TCTCCGACAT GGTTAAAGGA CTTGGAAAGG COGATGAAGT TACAATAAAG 660 
40 TTTGGAAATG AAATGCCCAT GCAAATGGAG TATTACATTA GAGATGAAGG AAGACTTACA 720 

TTCCTACTGG CTCCAAGAGT TGAAGAGTGA 750 

SEQ ID NO: 3 
45 SEQUENCE LENGTH: 327 

SEQUENCE TYPE: amino acid 
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STRANDEDNESS: single 
TOPOLOGY: linear 
MOLECULAR TYPE: peptide 
SEQUENCE DESCRIPTION: 

Met Ser Glu Glu lie Arg Glu Val Lys Val Leu Glu Lye Pro Trp 

5 10 15 

Val Glu Lys Tyr Arg Pro Gin Arg Leu Asp Asp lie Val Gly Gin 
20 25 30 

Glu His lie Val Lys Arg Leu Lys His Tyr Val Lys Thr Gly Ser 
35 40 45 

Met Pro His Leu Leu Phe Ala Gly Pro Pro Gly Val Gly Lys Thr 
50 55 60 

Thr Ala Ala Leu Ala Leu Ala Arg Glu Leu Phe Gly Glu Asn Trp 
65 70 75 

Arg His Asn Phe Leu Glu Leu Asn Ala Ser Asp Glu Arg Gly lie 
80 85 90 

Asn Val lie Arg Glu Lys Val Lys Glu Phe Ala Arg Thr Lys Pro 
95 100 105 

lie Gly Gly Ala Ser Phe Lys He He Phe Leu Asp Glu Ala Asp 
110 115 120 

Ala Leu Thr Gin Asp Ala Gin Gin Ala Leu Arg Arg Thr Met Glu 
125 130 135 

Met Phe Ser Ser Asn Val Arg Phe He Leu Ser Cys Asn Tyr Ser 
140 145 150 

Ser Lys He He Glu Pro He Gin Ser Arg Cys Ala He Phe Arg 
155 160 165 

Phe Arg Pro Leu Arg Asp Glu Asp He Ala Lys Arg Leu Arg Tyr 
170 175 180 

He Ala Glu Asn Glu Gly Leu Glu Leu Thr Glu Glu Gly Leu Gin 
185 190 195 

Ala He Leu Tyr He Ala Glu Gly Asp Met Arg Arg Ala He Asn 
200 205 210 

He Leu Gin Ala Ala Ala Ala Leu Asp Lys Lys He Thr Asp Glu 
215 220 225 
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Asn Val Phe Met Val Ala Ser Arg Ala Arg Pro Glu Asp lie Arg 

230 235 240 

Glu Met Met Leu Leu Ala Leu Lys Gly Asn Phe Leu Lys Ala Arg 

245 250 255 

Glu Lys Leu Arg Glu lie Leu Leu Lys Gin Gly Leu Ser Gly Glu 

260 265 270 

Asp Val Leu Val Gin Met His Lys Glu Val Phe Asn Leu Pro lie 

275 280 285 

Glu Glu Pro Lys Lys Val Leu Leu Ala Asp Lys lie Gly Glu Tyr 

290 295 300 

Asn Phe Arg Leu Val Glu Gly Ala Asn Glu He lie Gin Leu Glu 

305 310 315 

Ala Leu Leu Ala Gin Phe Thr Leu He Gly Lys Lys 

320 325 



25 



SEQ ID NO: 4 
SEQUENCE LENGTH: 984 
SEQUENCE TYPE: nucleic acid 



STRANDEDNESS: double 

TOPOLOGY: linear 

MOLECULAR TYPE: Genomic DNA 

30 SEQUENCE DESCRIPTION: 

ATGAGCGAAG AGATTAGAGA AGTTAAGGTT CTAGAAAAAC CCTGGGTTGA GAAGTATAGA 60 

CCTCAAAGAC TTGACGACAT TGTAGGACAA GAGCACATAG TGAAAAGGCT CAAGCACTAC 120 

GTCAAAACTG GATCAATGCC CCACCTACTC TTCGCAGGCC CCCCTGGTGT CGGAAAGACT 180 

35 ACAGCGGCTT TGGCCCTTGC AAGAGAGCTT TTCGGCGAAA ACTGGAGGCA TAACTTCCTC 240 

GAGTTGAATG CTTCAGATGA AAGAGGTATA AACGTAATTA GAGAGAAAGT TAAGGAGTTT 300 

GCGAGAACAA AGCCTATAGG AGGAGCAAGC TTCAAGATAA TTTTCCTTGA TGAGGCCGAC 360 

^ GCTTTAACTC AAGATGCCCA ACAAGCCTTA AGAAGAACCA TGGAAATGTT CTCGAGTAAC 420 

GTTCGCTTTA TCTTGAGCTG TAACTACTCC TCCAAGATAA TTGAACCCAT ACAGTCTAGA 480 

TGTGCAATAT TCCGCTTCAG ACCTCTCCGC GATGAGGATA TAGCGAAGAG ACTAAGGTAC 540 

ATTGCCGAAA ATGAGGGCTT AGAGCTAACT GAAGAAGGTC TCCAAGCAAT ACTTTACATA 600 

45 GCAGAAGGAG ATATGAGAAG AGCAATAAAC ATTCTGCAAG CTGCAGCAGC TCTAGACAAG 660 

AAGATCACCG ACGAAAACGT ATTCATGGTA GCGAGTAGAG CTAGACCTGA AGATATAAGA 720 
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GAGATGATGC TTCTTGCTCT CAAAGGCAAC TTCTTGAAGG CCAGAGAAAA GCTTAGGGAG 760 
ATACTTCTCA AGCAAGGACT TAGTGGAGAA GATGTACTAG TTCAGATGCA CAAAGAAGTC 840 
TTCAACCTGC CAATAGAGGA GCCAAAGAAG GTTCTGCTTG CTGATAAGAT AGGAGAGTAT 900 
AACTTCAGAC TCGTTGAAGG GGCTAATGAA ATAATTCAGC TTGAAGCACT CTTAGCACAG 960 
TTCACCCTAA TTGGGAAGAA GTGA 984 



SEQ ID NO: 5 
SEQUENCE LENGTH: 613 
SEQUENCE TYPE: amino acid 
STRANDEDNESS : single 
TOPOLOGY: linear 
MOLECULAR TYPE: peptide 
20 SEQUENCE DESCRIPTION: 

Met Asp Glu Phe Val Lys Ser Leu Leu Lys Ala Asn Tyr Leu lie 
5 10 15 

Thr Pro Ser Ala Tyr Tyr Leu Leu Arg Glu Tyr Tyr Glu Lys Gly 
25 20 25 30 

Glu Phe Ser lie Val Glu Leu Val Lys Phe Ala Arg Ser Arg Glu 
35 40 45 

Ser Tyr He He Thr Asp Ala Leu Ala Thr Glu Phe Leu Lys Val 
50 55 60 

Lys Gly Leu Glu Pro He Leu Pro Val Glu Thr Lys Gly Gly Phe 
65 70 75 

3S Val Ser Thr Gly Glu Ser Gin Lys Glu Gin Ser Tyr Glu Glu Ser 

80 85 90 

Phe Gly Thr Lys Glu Glu He Ser Gin Glu He Lys Glu Gly Glu 
95 100 105 

46 Ser Phe He Ser Thr Gly Ser Glu Pro Leu Glu Glu Glu Leu Asn 

110 115 120 

Ser He Gly He Glu Glu He Gly Ala Asn Glu Glu Leu Val Ser 
125 130 135 

Asn Gly Asn Asp Asn Gly Gly Glu Ala He Val Phe Asp Lys Tyr 
140 145 150 
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Gly Tyr Pro Met Val Tyr Ala Pro Glu Glu He Glu Val Glu Glu 

155 160 165 

Lys Glu Tyr Ser Lys Tyr Glu Asp Leu Thr He Pro Met Asn Pro 

170 175 180 

Asp Phe Asn Tyr Val Glu He Lys Glu Asp Tyr Asp Val val Pha 

185 190 195 

Asp Val Arg Asn Val Lys Leu Lys Pro Pro Lys Val Lys Asn Gly 

200 205 210 

Asn Gly Lys Glu Gly Glu He He Val Glu Ala Tyr Ala Ser Leu 

215 220 225 

Phe Arg Ser Arg Leu Lys Lys Leu Arg Lys He Leu Arg Glu Asn 

230 235 240 

Pro Glu Leu Asp Asn Val Val Asp He Gly Lys Leu Lys Tyr Val 

245 250 255 

Lys Glu Asp Glu Thr Val Thr He He Gly Leu Val Asn Ser Lys 

260 265 270 

Arg Glu Val Asn Lys Gly Leu He Phe Glu He Glu Asp Leu Thr 

275 280 285 

Gly Lys Val Lys Val Phe Leu Pro Lys Asp Ser Glu Asp Tyr Arg 

290 295 300 

Glu Ala Phe Lys Val Leu Pro Asp Ala Val Val Ala Phe Lys Gly 

305 310 315 

Val Tyr Ser Lys Arg Gly He Leu Tyr Ala Asn Lys Phe Tyr Leu 

320 325 330 

Pro Asp Val Pro Leu Tyr Arg Arg Gin Lys Pro Pro Leu Glu Glu 

335 340 345 

Lys Val Tyr Ala He Leu He Ser Asp He His Val Gly Ser Lys 

350 355 360 

Glu Phe Cys Glu Asn Ala Phe He Lys Phe Leu Glu Trp Leu Asn 

365 370 375 

Gly Asn Val Glu Thr Lys Glu Glu Glu Glu He Val Ser Arg Val 

380 385 390 

Lys Tyr Leu He He Ala Gly Asp Val Val Asp Gly Val Gly Val 

395 400 405 
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Tyr Pro Gly Gin Tyr Ala Asp Leu Thr lie Pro Asp lie Phe Asp 
410 415 420 

Gin Tyr Glu Ala Leu Ala Asn Leu Leu Ser His Val Pro Lys His 
425 430 435 

lie Thr Met Phe lie Ala Pro Gly Asn His Asp Ala Ala Arg Gin 
440 445 450 

Ala lie Pro Gin Pro Glu Phe Tyr Lys Glu Tyr Ala Lys Pro He 
455 460 465 

Tyr Lys Leu Lys Asn Ala Val He He Ser Asn Pro Ala Val He 
470 475 480 

Arg Leu His Gly Arg Asp Phe Leu He Ala His Gly Arg Gly He 
485 490 495 

Glu Asp Val Val Gly Ser Val Pro Gly Leu Thr His His Lys Pro 
500 505 510 

Gly Leu Pro Met Val Glu Leu Leu Lys Met Arg His Val Ala Pro 
515 520 525 

Met Phe Gly Gly Lys Val Pro He Ala Pro Asp Pro Glu Asp Leu 
530 535 540 

Leu Val He Glu Glu Val Pro Asp Val Val His Met Gly His Val 
545 550 555 

His Val Tyr Asp Ala Val Val Tyr Arg Gly Val Gin Leu Val Asn 
560 565 570 

Ser Ala Thr Trp Gin Ala Gin Thr Glu Phe Gin Lys Met Val Asn 
575 580 585 

He Val Pro Thr Pro Ala Lys Val Pro Val Val Asp He Asp Thr 
590 595 600 

Ala Lys Val Val Lys Val Leu Asp Phe Ser Gly Trp Cys 
605 610 



SEQ ID NO: 6 
SEQUENCE LENGTH: 1263 
SEQUENCE TYPE: amino acid 
STRANDEDNESS : single 
TOPOLOGY: linear 
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MOLECULAR TYPE: peptide 
SEQUENCE DESCRIPTION: 

Met Glu Leu Pro Lys Glu He Glu Glu Tyr Phe Glu Met Leu Gin 

5 10 15 

Arg Glu He Asp Lys Ala Tyr Glu He Ala Lys Lys Ala Arg Ser 

20 25 30 

Gin Gly Lys Asp Pro Ser Thr Asp Val Glu He Pro Gin Ala Thr 

35 40 45 

Asp Met Ala Gly Arg Val Glu Ser Leu Val Gly Pro Pro Gly Val 
50 55 60 

Ala Gin Arg He Arg Glu Leu Leu Lys Glu Tyr Asp Lys Glu He 
65 70 75 

Val Ala Leu Lys He Val Asp Glu He He Glu Gly Lys Phe Gly 
80 85 90 

Asp Phe Gly Ser Lys Glu Lys Tyr Ala Glu Gin Ala Val Arg Thr 
95 100 105 

Ala Leu Ala He Leu Thr Glu Gly He Val Ser Ala Pro Leu Glu 

110 115 120 

Gly He Ala Asp Val Lys He Lys Arg Asn Thr Trp Ala Asp Asn 

125 130 135 

Ser Glu Tyr Leu Ala Leu Tyr Tyr Ala Gly Pro He Arg Ser Ser 

140 145 ISO 

Gly Gly Thr Ala Gin Ala Leu Ser Val Leu Val Gly Asp Tyr Val 

155 160 165 

Arg Arg Lys Leu Gly Leu Asp Arg Phe Lys Pro Ser Gly Lys His 

170 175 180 

He Glu Arg Met Val Glu Glu Val Asp Leu Tyr His Arg Ala Val 

185 190 195 

Ser Arg Leu Gin Tyr His Pro Ser Pro Asp Glu Val Arg Leu Ala 

200 205 210 

Met Arg Asn He Pro He Glu He Thr Gly Glu Ala Thr Asp Asp 

215 220 225 

Val Glu Val Ser His Arg Asp Val Glu Gly Val Glu Thr Asn Gin 

230 235 240 
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Leu Arg Gly 
Lys Ala Lys 
Gly Trp Glu 
Glu Glu He 
Glu Thr Arg 
Glu Lys Phe 
Glu He He 
Gly Gly Phe 
Ala Thr Trp 
Phe Leu Ala 
Gly Ala Val 
Lys Leu Lys 
Ala Leu Lys 
Asp Ala He 
Leu Leu Pro 
Val Lys Ala 
Glu Glu Asn 



Gly Ala He 

245 
Lys Leu Val 

260 
Trp Leu Lys 

275 
Glu Glu Ser 

290 
Val Glu Val 

305 
Arg Ala Glu 

320 
Gly Gly Arg 

335 
Arg Leu Arg 

350 
Ser He Asn 

365 
He Gly Thr 

380 
Val Thr Pro 

395 
Asp Gly Ser 

410 
He Arg Asp 

425 
He Ala Phe 

440 
Ala Asn Tyr 

455 
Val Asn Glu 

470 
Pro Arg Glu 

485 



Leu Val Leu 
Lys Tyr He 
Glu Phe Val 
Glu Ser Lys 
Glu Lys Gly 
He Ala Pro 
Pro Leu Phe 
Tyr Gly Arg 
Pro Ala Thr 
Gin Met Lys 
Ala Thr Thr 
Val Val Arg 
Glu Val Glu 
Gly Asp Phe 
Val Glu Glu 
Ala Tyr Glu 
Ser Val Glu 



Ala Glu Gly 
250 

Asp Lys Met 
265 

Glu Ala Lys 
280 

Ala Glu Glu 
295 

Phe Tyr Tyr 
310 

Ser Glu Lys 
325 

Ala Gly Pro 
340 

Ser Arg Val 
355 

Met Val Leu 
370 

Thr Glu Arg 
385 

Ala Glu Gly 
400 

Val Asp Asp 
415 

Glu He Leu 
430 

Val Glu Asn 
445 

Trp Trp He 
460 

Val Glu Leu 
475 

Glu Ala Ala 
490 



Val Leu Gin 
255 

Gly He Asp 
270 

Glu Lys Gly 
285 

Ser Lys Val 
300 

Lys Leu Tyr 
315 

Tyr Ala Lys 
330 

Ser Glu Asn 
345 

Ser Gly Phe 
360 

Val Asp Glu 
375 

Pro Gly Lys 
390 

Pro He Val 
405 

Tyr Asn Leu 
420 

Tyr Leu Gly 
435 

Asn Gin Thr 
450 

Gin Glu Phe 
465 

Arg Pro Phe 
480 

Glu Tyr Leu 
495 
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Glu Val Asp Pro Glu Phe Leu Ala Lys Met Leu Tyr Asp Pro Leu 
500 505 510 

Arg Val Lys Pro Pro Val Glu Leu Ala He His Phe Ser Glu He 
515 520 525 

Leu Glu He Pro Leu His Pro Tyr Tyr Thr Leu Tyr Trp Asn Thr 
530 535 540 

Val Asn Pro Lys Asp Val Glu Arg Leu Trp Gly Val Leu Lys Asp 
545 550 555 

Lys Ala Thr He Glu Trp Gly Thr Phe Arg Gly He Lys Phe Ala 
560 565 570 

Lys Lys He Glu He Ser Leu Asp Asp Leu Gly Ser Leu Lys Arg 
575 580 585 

Thr Leu Glu Leu Leu Gly Leu Pro His Thr Val Arg Glu Gly He 
590 595 600 

Val Val Val Asp Tyr Pro Trp Ser Ala Ala Leu Leu Thr Pro Leu 
605 610 615 

Gly Asn Leu Glu Trp Glu Phe Lys Ala Lys Pro Phe Tyr Thr Val 
620 625 630 

He Asp He He Asn Glu Asn Asn Gin He Lys Leu Arg Asp Arg 
635 640 645 

Gly He Ser Trp He Gly Ala Arg Met Gly Arg Pro Glu Lys Ala 
650 655 660 

Lys Glu Arg Lys Met Lys Pro Pro Val Gin Val Leu Phe Pro He 
665 670 675 

Gly Leu Ala Gly Gly Ser Ser Arg Asp He Lys Lys Ala Ala Glu 
680 685 690 

Glu Gly Lys He Ala Glu Val Glu He Ala Phe Phe Lys Cys Pro 
695 700 705 

Lys Cys Gly His Val Gly Pro Glu Thr Leu Cys Pro Glu Cys Gly 
710 715 720 

He Arg Lys Glu Leu He Trp Thr Cys Pro Lys Cys Gly Ala Glu 
725 730 735 

Tyr Thr Asn Ser Gin Ala Glu Gly Tyr Ser Tyr Ser Cys Pro Lys 
740 745 750 
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Cys Asn Val Lys Leu Lys Pro Phe Thr Lys Arg Lys lie Lys Pro 

755 760 765 

Ser Glu Leu Leu Asn Arg Ala Met Glu Asn Val Lys Val Tyr Gly 

770 775 780 

Val Asp Lys Leu Lys Gly Val Met Gly Met Thr Ser Gly Trp Lys 

785 790 795 

He Ala Glu Pro Leu Glu Lys Gly Leu Leu Arg Ala Lys Asn Glu 

800 805 810 

Val Tyr Val Phe Lys Asp Gly Thr lie Arg Phe Asp Ala Thr Asp 

815 820 825 

Ala Pro lie Thr His Phe Arg Pro Arg Glu lie Gly Val Ser Val 

830 835 840 

Glu Lys Leu Arg Glu Leu Gly Tyr Thr His Asp Phe Glu Gly Lys 

845 850 855 

Pro Leu Val Ser Glu Asp Gin He Val Glu Leu Lys Pro Gin Asp 

860 865 870 

Val lie Leu Ser Lys Glu Ala Gly Lys Tyr Leu Leu Arg Val Ala 

875 880 885 

Arg Phe Val Asp Asp Leu Leu Glu Lys Phe Tyr Gly Leu Pro Arg 

890 895 900 

Phe Tyr Asn Ala Glu Lys Met Glu Asp Leu He Gly His Leu Val 

905 910 915 

He Gly Leu Ala Pro His Thr Ser Ala Gly He Val Gly Arg He 

920 925 930 

He Gly Phe Val Asp Ala Leu Val Gly Tyr Ala His Pro Tyr Phe 

935 940 945 

His Ala Ala Lys Arg Arg Asn Cys Asp Gly Asp Glu Asp Ser Val 

950 955 960 

Met Leu Leu Leu Asp Ala Leu Leu Asn Phe Ser Arg Tyr Tyr Leu 
965 970 975 

Pro Glu Lys Arg Gly Gly Lys Met Asp Ala Pro Leu Val He Thr 
980 985 990 

Thr Arg Leu Asp Pro Arg Glu Val Asp Ser Glu Val His Asn Met 
995 1000 1005 
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Asp Val Val Arg Tyr Tyr Pro Leu Glu Phe Tyr Glu Ala Thr Tyr 

1010 1015 1020 

Glu Leu Lys Ser Pro Lys Glu Leu Val Arg Val He Glu Gly Val 

1025 1030 1035 

Glu Asp Arg Leu Gly Lys Pro Glu Met Tyr Tyr Gly He Lys Phe 

1040 1045 1050 

Thr His Asp Thr Asp Asp He Ala Leu Gly Pro Lys Met Ser Leu 

1055 1060 1065 

Tyr Lys Gin Leu Gly Asp Met Glu Glu Lys Val Lys Arg Gin Leu 

1070 1075 1080 

Thr Leu Ala Glu Arg He Arg Ala Val Asp Gin His Tyr Val Ala 

1085 1090 1095 

Glu Thr He Leu Asn Ser His Leu He Pro Asp Leu Arg Gly Asn 

1100 1105 mo 

Leu Arg Ser Phe Thr Arg Gin Glu Phe Arg Cys Val Lys Cys Asn 

1115 1120 H25 

Thr Lys Tyr Arg Arg Pro Pro Leu Asp Gly Lys Cys Pro Val Cys 

1130 1135 1140 

Gly Gly Lys He Val Leu Thr Val Ser Lys Gly Ala He Glu Lys 

1145 1150 H55 

Tyr Leu Gly Thr Ala Lys Met Leu Val Ala Asn Tyr Asn Val Lys 

1160 1165 1170 

Pro Tyr Thr Arg Gin Arg He Cys Leu Thr Glu Lys Asp He Asp 

1175 1180 1185 

Ser Leu Phe Glu Tyr Leu Phe Pro Glu Ala Gin Leu Thr Leu He 

1190 H95 1200 

Val Asp Pro Asn Asp He Cys Met Lys Met He Lys Glu Arg Thr 

1205 1210 1215 

Gly Glu Thr Val Gin Gly Gly Leu Leu Glu Asn Phe Asn Ser Ser 

1220 1225 1230 

Gly Asn Asn Gly Lys Lys He Glu Lys Lys Glu Lys Lys Ala Lys 

1235 1240 1245 

Glu Lys Pro Lys Lys Lys Lys Val He Ser Leu Asp Asp Phe Phe 

1250 1255 1260 
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Ser Lys Arg 

SEQ ID NO: 7 
SEQUENCE LENGTH: 20 
SEQUENCE TYPE: amino acid 
STRANDEDNESS: single 
TOPOLOGY: linear 
MOLECULAR TYPE: peptide 
SEQUENCE DESCRIPTION: 

Met Asp Lys Glu Gly Phe Leu Asn Lys Val Arg Glu Ala Val Asp 

5 10 15 

Val Val Lys Leu His 
20 

SEQ ID NO: 8 
SEQUENCE LENGTH: 20 
SEQUENCE TYPE: amino acid 
STRANDEDNESS: single 
TOPOLOGY: linear 
MOLECULAR TYPE: peptide 
SEQUENCE DESCRIPTION: 

Met Phe Thr Gly Lys Val Leu He Pro Val Lys Val Leu Lys Lys 

5 10 15 

Phe Glu Asn Trp Asn 
20 

SEQ ID NO: 9 
SEQUENCE LENGTH: 20 
SEQUENCE TYPE: amino acid 
STRANDEDNESS: single 
TOPOLOGY: linear 
MOLECULAR TYPE: peptide 
SEQUENCE DESCRIPTION: 

Met He Gly Ser He Phe Tyr Ser Lys Lys Phe Asn Leu His Arg 
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5 10 IS 

Pro Ser Glu Tyr His 
20 

SEQ ID NO: 10 
SEQUENCE LENGTH: 20 
SEQUENCE TYPE: amino acid 
STRANDEDNESS : single 
TOPOLOGY: linear 
MOLECULAR TYPE: peptide 
SEQUENCE DESCRIPTION: 

Met Lys Asp Tyr Arg Pro Leu Leu Gly Ala lie Lys Val Lys Gly 

5 10 15 

Asp Asn Val Phe Ser 
20 

SEQ ID NO: 11 
SEQUENCE LENGTH: 18 
SEQUENCE TYPE: amino acid 
STRANDEDNESS: single 
TOPOLOGY: linear 
MOLECULAR TYPE: peptide 
SEQUENCE DESCRIPTION: 

Met Asp lie Glu Val Leu Arg Arg Leu Leu Glu Arg Glu Leu Ser 

5 10 15 

Ser Glu His 

SEQ ID NO: 12 
SEQUENCE LENGTH: 17 
SEQUENCE TYPE: amino acid 
STRANDEDNESS: single 
TOPOLOGY: linear 
MOLECULAR TYPE: peptide 
SEQUENCE DESCRIPTION: 
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Pro Phe Glu lie Val Phe Glu Gly Ala Lys Glu Phe Ala Gin Leu 

5 10 15 

lie Asp 

SEQ ID NO: 13 
SEQUENCE LENGTH: 17 
SEQUENCE TYPE: nucleic acid 
STRANDEDNESS: single 
TOPOLOGY: linear 

MOLECULAR TYPE: other nucleic acid (synthetic DNA) 
SEQUENCE DESCRIPTION: 

ATGGATAARG ARGGNTT 17 

SEQ ID NO: 14 
SEQUENCE LENGTH: 20 
SEQUENCE TYPE: nucleic acid 
STRANDEDNESS: single 
TOPOLOGY: linear 

MOLECULAR TYPE: other nucleic acid (synthetic DNA) 
SEQUENCE DESCRIPTION: 

AATAAAGTWA GRGARGCNGT 20 

SEQ ID NO: 15 
SEQUENCE LENGTH: 20 
SEQUENCE TYPE: nucleic acid 
STRANDEDNESS: single 
TOPOLOGY: linear 

MOLECULAR TYPE: other nucleic acid (synthetic DNA) 
SEQUENCE DESCRIPTION: 

CTCTGCGGCA ATTCTTGCAA 20 

SEQ ID NO: 16 
SEQUENCE LENGTH: 20 
SEQUENCE TYPE: nucleic acid 
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STRANDEDNESS : single 
TOPOLOGY: linear 

MOLECULAR TYPE: other nucleic acid (synthetic DNA) 
SEQUENCE DESCRIPTION: 

CTTGCAAAGA AGTATGTAAC 20 

SEQ ID NO: 17 
SEQUENCE LENGTH: 2009 
SEQUENCE TYPE: nucleic acid 
STRANDEDNESS: double 
TOPOLOGY: linear 
MOLECULAR TYPE: Genomic DNA 
SEQUENCE DESCRIPTION: 

AAGCTTCCAA AGAACTGGCG TTACGACCCA GAGACTGCAA AGTTGCTCGT CCGCTGATCC 60 
TTCCCTATAT TTTCATTTGG TGTTTTTCAT GGATAAGGAG GGTTTTTTGA ACAAGGTTAG 120 
GGAGGCTGTG GATGTAGTAA AGCTCCACAT CGAGTTAGGT CATACTATAA GGATAATCTC 180 
TCATAGGGAT GCGGATGGAA TAACCTCTGC GGCAATTCTT GCAAAGGCTT TGGGAAGAGA 240 
AGGAGCGAGC TTTCACATTT CGATTGTTAA ACAGGTAAGT GAAGATCTTT TAAGAGAATT 300 
AAAGGATGAA GATTACAAAA TCTTCATTTT TTCCGACCTG GGTAGTGGTT CTTTAAGTTT 360 
GATAAAAGAG TATCTTAAGG AAAAAACTGT TATAATCCTT GATCACCATC CTCCGGAAAA 420 
TGTGAAGTTG GAAGAAAAGC ATATACTTGT TAATCCAGTT CAATTTGGCG CAAATAGCGT 480 
TAGGGATCTG AGTGGATCTG GGGTTACATA CTTCTTTGCA AGGGAGCTAA ATGAAAAGAA 540 
TAGGGACCTT GCTTACATTG CAATAGTGGG AGCAGTTGGG GATATGCAAG AGAACGATGG 600 
AGTTTTCCAT GGGATGAACC TTGATATTAT TGAAGATGGG AAATCTCTGG GAATTCTTGA 660 
GGTTAAAAAA GAATTGCGCC TGTTTGGTAG GGAAACTAGA CCTCTCTATC AAATGCTCGC 720 
ATATGCCACA AATCCGGAAA TTCCTGAAGT TACTGGAGAC GAGAGGAAGG CCATAGAGTG 780 
GTTAAAGAAC AAGGGCTTCA ATCCCGAGAA AAAATATTGG GAATTAAGTG AGGAGGAAAA 840 
GAAAAAGTTA CATGATTTCC TAATCATTCA CATGATCAAG CATGGAGCTG GAAAAGAGGA 900 
TATAGATAGG CTAATAGGAG ACGTTGTTAT TAGTCCCTTA TATCCTGAAG GGGATCCCAG 960 
GCACGAGGCT AGAGAATTTG CTACCCTATT AAACGCTACA GGCAGGTTAA ACTTGGGCAA 1020 
CTTAGGAGTG GCTGTATGTT TGGGAGATGA GGAGGCTTTC AGAAAGGCCC TAAAGATGGT 1080 
TGAAGACTAC AAGAGGGAGC AAATTGAAGC AAGAAAGTGG CTACTTCAAA ATTGGAACAG 1140 
TGAAGTTTGG GAGGGGGATC ATGTTTACGT CTTATATGTG GGAAAGAGTA TTAGAGATAC 1200 
TCTCGTTGGA ATAGCAGCTA GCATGGCCAT CAATGCTGGA CTGGCAGATC CTGAAAAGCC 1260 
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GGTTATAGTG TTTGCAGATA CTGATGAAGA TCCAAACCTT CTCAAAGGTT CAGCTAGAAC 1320 
AACTGAAAGG GCTTTAGCTA AGGGTTACAA TTTGGGAGAA GCTCTTAGGA AAGCGGCTGA 1380 

5 GCTAGTGAAT GGGGAAGGGG GAGGACACGC GATAGCTGCA GGTATAAGAA TTCCCAGGGC 1440 

CAGGTTGGCG GAGTTTAGAA AATTAATAGA TAAAATCCTT GGAGAACAGG TGAGCAAAGG 1500 
TGGAGATAAA AGCGAAAGCT GAAATATTGT GGGAGTACAG CGATGAGAAG GTTGCTGAGG 1560 
CTATTGCGAA GTCTGTTGAT GTTGATAATA TTTCTCTCCC TCCAAACCTC AAGAAAAGTT 1620 

10 TAAATCTTAT GACGTTTTCC GATGGAGCGA AGGTAATAAC AAAGGTTAAA TATCATGGAG 1680 

AAATTGAGAC TCTCATAGTT GCTCTCGATG ATTTGATATT CGCTGTAAAA GTTGCTGAGG 1740 
AGGTGTTATG ATGGTGNGAA AAGGGNAACA ACAACANGGG ATAAGGGAAG NTGAAGCAAT 1800 
GGTATATTAT TTATGCTCCN GANTTCTTGG GCGGGGTAGA GGTAGGATTA ACGCCAGCAG 1860 

15 

ACGATCCAGA GAAAGTACTC AACAGAGTCG TTGAAGTTAC TCTGAAGGAT GTTACAGGAG 1920 
ACTTTACAAA GAGTCACGTG AAGCTCTATT TCCAAGTATA TGATGTCAAG GGACAGAATG 1980 
CCTACACAAA GTTCAAGGGA ATGAAGCTT 2009 

20 

SEQ ID NO: 18 
SEQUENCE LENGTH: 1434 
SEQUENCE TYPE: nucleic acid 
25 STRAND EDNESS : double 

TOPOLOGY: linear 

MOLECULAR TYPE: Genomic DNA 

SEQUENCE DESCRIPTION: 

30 ATGGATAAGG AGGGTTTTTT GAACAAGGTT AGGGAGGCTG TGGATGTAGT AAAGCTCCAC 60 

ATCGAGTTAG GTCATACTAT AAGGATAATC TCTCATAGGG ATGCGGATGG AATAACCTCT 120 
GCGGCAATTC TTGCAAAGGC TTTGGGAAGA GAAGGAGCGA GCTTTCACAT TTCGATTGTT 180 
AAACAGGTAA GTGAAGATCT TTTAAGAGAA TTAAAGGATG AAGATTACAA AATCTTCATT 240 

35 TTTTCCGACC TGGGTAGTGG TTCTTTAAGT TTGATAAAAG AGTATCTTAA GGAAAAAACT 300 

GTTATAATCC TTGATCACCA TCCTCCGGAA AATGTGAAGT TGGAAGAAAA GCATATACTT 360 
GTTAATCCAG TTCAATTTGG CGCAAATAGC GTTAGGGATC TGAGTGGATC TGGGGTTACA 420 
TACTTCTTTG CAAGGGAGCT AAATGAAAAG AATAGGGACC TTGCTTACAT TGCAATAGTG 480 

40 GGAGCAGTTG GGGATATGCA AGAGAACGAT GGAGTTTTCC ATGGGATGAA CCTTGATATT 540 

ATTGAAGATG GGAAATCTCT GGGAATTCTT GAGGTTAAAA AAGAATTGCG CCTGTTTGGT 600 
AGGGAAACTA GACCTCTCTA TCAAATGCTC GCATATGCCA CAAATCCGGA AATTCCTGAA 660 
GTTACTGGAG ACGAGAGGAA GGCCATAGAG TGGTTAAAGA ACAAGGGCTT CAATCCCGAG 720 

45 

AAAAAATATT GGGAATTAAG TGAGGAGGAA AAGAAAAAGT TACATGATTT CCTAATCATT 780 
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CACATGATCA AGCATGGAGC TGGAAAAGAG GATATAGATA GGCTAATAGG AGACGTTGTT 840 
ATTAGTCCCT TATATCCTGA AGGGGATCCC AGGCACGAGG CTAGAGAATT TGCTACCCTA 900 
TTAAACGCTA CAGGCAGGTT AAACTTGGGC AACTTAGGAG TGGCTGTATG TTTGGGAGAT 960 
GAGGAGGCTT TCAGAAAGGC CCTAAAGATG GTTGAAGACT ACAAGAGGGA GCAAATTGAA 1020 
GCAAGAAAGT GGCTACTTCA AAATTGGAAC AGTGAAGTTT GGGAGGGGGA TCATGTTTAC 1080 
GTCTTATATG TGGGAAAGAG TATTAGAGAT ACTCTCGTTG GAATAGCAGC TAGCATGGCC 1140 
ATCAATGCTG GACTGGCAGA TCCTGAAAAG CCGGTTATAG TGTTTGCAGA TACTGATGAA 1200 
GATCCAAACC TTCTCAAAGG TTCAGCTAGA ACAACTGAAA GGGCTTTAGC TAAGGGTTAC 1260 
AATTTGGGAG AAGCTCTTAG GAAAGCGGCT GAGCTAGTGA ATGGGGAAGG GGGAGGACAC 1320 
GCGATAGCTG CAGGTATAAG AATTCCCAGG GCCAGGTTGG CGGAGTTTAG AAAATTAATA 1380 
GATAAAATCC TTGGAGAACA GGTGAGCAAA GGTGGAGATA AAAGCGAAAG CTGA 1434 



25 



30 



SEQ ID NO: 19 
20 SEQUENCE LENGTH: 477 

SEQUENCE TYPE: amino acid 
STRANDEDNESS: single 
TOPOLOGY: linear 
MOLECULAR TYPE: peptide 
SEQUENCE DESCRIPTION: 

Met Asp Lys Glu Gly Phe Leu Asn Lys Val Arg Glu Ala Val Asp 

5 10 15 

Val Val Lys Leu His lie Glu Leu Gly His Thr He Arg He He 

20 25 30 

Ser His Arg Asp Ala Asp Gly He Thr Ser Ala Ala He Leu Ala 

35 35 40 45 

Lys Ala Leu Gly Arg Glu Gly Ala Ser Phe His He Ser He Val 

50 55 60 

Lys Gin Val Ser Glu Asp Leu Leu Arg Glu Leu Lys Asp Glu Asp 

«> 65 70 75 
Tyr Lys He Phe He Phe Ser Asp Leu Gly Ser Gly Ser Leu Ser 

80 85 90 
Leu He Lys Glu Tyr Leu Lys Glu Lys Thr Val He He Leu Asp 

95 100 105 
His His Pro Pro Glu Asn Val Lys Leu Glu Glu Lys His He Leu 
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Val Asn Pro 
Gly Ser Gly 
Asn Arg Asp 
Met Gin Glu 
lie Glu Asp 
Leu Arg Leu 
Ala Tyr Ala 
Arg Lys Ala 
Lys Lys Tyr 
Asp Phe Leu 
Asp lie Asp 
Pro Glu Gly 
Leu Asn Ala 
Val Cys Leu 
Val Glu Asp 
Leu Gin Asn 
Val Leu Tyr 



110 
Val Gin Phe 

125 
Val Thr Tyr 

140 
Leu Ala Tyr 

155 
Asn Asp Gly 

170 
Gly Lys Ser 

185 
Phe Gly Arg 

200 
Thr Asn Pro 

215 
lie Glu Txp 

230 
Trp Glu Leu 

245 
He He His 

260 
Arg Leu He 

275 
Asp Pro Arg 

290 
Thr Gly Arg 

305 
Gly Asp Glu 

320 
Tyr Lys Arg 

335 
Trp Asn Ser 

350 
Val Gly Lys 



Gly Ala Asn 
Phe Phe Ala 
He Ala He 
Val Phe His 
Leu Gly He 
Glu Thr Arg 
Glu He Pro 
Leu Lys Asn 
Ser Glu Glu 
Met He Lys 
Gly Asp Val 
His Glu Ala 
Leu Asn Leu 
Glu Ala Phe 
Glu Gin lie 
Glu Val Trp 
Ser He Arg 



115 

Ser Val Arg 
130 

Arg Glu Leu 
145 

Val Gly Ala 
160 

Gly Met Asn 
175 

Leu Glu Val 
190 

Pro Leu Tyr 
205 

Glu Val Thr 
220 

Lys Gly Phe 
235 

Glu Lys Lys 
250 

His Gly Ala 
265 

Val He Ser 
280 

Arg Glu Phe 
295 

Gly Asn Leu 
310 

Arg Lys Ala 
325 

Glu Ala Arg 
340 

Glu Gly Asp 
355 

Asp Thr Leu 



120 

Asp Leu Ser 
135 

Asn Glu Lys 
150 

Val Gly Asp 
165 

Leu Asp He 
180 

Lys Lys Glu 
195 

Gin Met Leu 
210 

Gly Asp Glu 
225 

Asn Pro Glu 
240 

Lys Leu His 
255 

Gly Lys Glu 
270 

Pro Leu Tyr 
285 

Ala Thr Leu 
300 

Gly Val Ala 
315 

Leu Lys Met 
330 

Lys Trp Leu 
345 

His Val Tyr 
360 

Val Gly He 
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365 370 375 

Ala Ala Ser Met Ala lie Asn Ala Gly Leu Ala Asp Pro Glu Lys 

380 385 390 

Pro Val He Val Phe Ala Asp Thr Asp Glu Asp Pro Asn Leu Leu 

395 400 405 

Lys Gly Ser Ala Arg Thr Thr Glu Arg Ala Leu Ala Lys Gly Tyr 

410 415 420 

Asn Leu Gly Glu Ala Leu Arg Lys Ala Ala Glu Leu Val Asn Gly 

425 430 435 

Glu Gly Gly Gly His Ala He Ala Ala Gly He Arg He Pro Arg 

440 445 450 

Ala Arg Leu Ala Glu Phe Arg Lys Leu He Asp Lys He Leu Gly 

455 460 465 

Glu Gin Val Ser Lys Gly Gly Asp Lys Ser Glu Ser 

470 475 

SEQ ID NO: 20 
SEQUENCE LENGTH: 31 
SEQUENCE TYPE: nucleic acid 
STRAND EDNESS : single 
TOPOLOGY: linear 

MOLECULAR TYPE: other nucleic acid (synthetic DNA) 
SEQUENCE DESCRIPTION: 

TTCATTTGGT GTTTTCCATG GATAAGGAGG G 31 

SEQ ID NO: 21 
SEQUENCE LENGTH: 23 
SEQUENCE TYPE: nucleic acid 
STRAND EDNESS : single 
TOPOLOGY: linear 

MOLECULAR TYPE: other nucleic acid (synthetic DNA) 
SEQUENCE DESCRIPTION: 

AAAGTWYTAA TWCCWGTNAA RGT 23 
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SEQ ID NO: 22 
SEQUENCE LENGTH: 23 
SEQUENCE TYPE: nucleic acid 
STRANDEDNESS : single 
TOPOLOGY: linear 

MOLECULAR TYPE: other nucleic acid (synthetic DNA) 
SEQUENCE DESCRIPTION: 

AAAGTWYTAA AAAARTTYGA RAA 23 

SEQ ID NO: 23 
SEQUENCE LENGTH: 20 
SEQUENCE TYPE: nucleic acid 
STRANDEDNESS: single 
TOPOLOGY: linear 

MOLECULAR TYPE: other nucleic acid (synthetic DNA) 
SEQUENCE DESCRIPTION : 

GATACTGCTA GAAGATTGGA 20 

SEQ ID NO: 24 
SEQUENCE LENGTH: 20 
SEQUENCE TYPE: nucleic acid 
STRANDEDNESS: single 
TOPOLOGY: linear 

MOLECULAR TYPE: other nucleic acid (synthetic DNA) 
SEQUENCE DESCRIPTION: 

TTCGTACAGT CCCTCTGGTA 20 

SEQ ID NO: 25 
SEQUENCE LENGTH: 957 
SEQUENCE TYPE: nucleic acid 
STRANDEDNESS: double 
TOPOLOGY: linear 
MOLECULAR TYPE: Genomic DNA 
SEQUENCE DESCRIPTION: 
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TO 



15 



20 



CTACGAA6CT AAAATTTGAT <?TCTCAACTC AAGGACTTTT AGCTTATAAA ATGTGTCAAG 60 
TCTTCCCCGA ACTTTCTCCT CCAGTAAGGG TTTTGTACCT CTCAGCAAAG ACAGGAGTAG 120 
GATTTGAAGA CCTTGAAACT TTAGCGTATG AACATTATTG TACATGCGGC GACCTCACTT 180 
AGATTTTTTA ACCCCTATTT TCTCTAATGT CATTCAAGTA TTGGGGGAGT AATCATGTTC 240 
ACGGGTAAGG TATTGATTCC AGTAAAAGTA CTCAAGAAGT TTGAGAATTG GAATGAAGGA 300 
GATATGATAC TGCTAGAAGA TTGGAAAGCC AAGGAATTGT GGGAGAGTGG AGTAGTTGAA 360 
ATAATCGATG AAGCTGATAA AGTCATAGGA GAGATCGATA GAGTGTTATC AGAAGAAAAG 420 
AAAAACCTCC CATTGACTCC AATACCAGAG GGACTGTACG AAAAAGCTGA ATTTTACATC 480 
TATTATCTAG AAAAGTACAT CCAAGAGAAG GTCGACAACA TAGAAACAAT ACAAACTAAG 540 
GTCACAAAGT TAGCAAATCT AAAGAAGAAG TATAAGACTC TGAAAGAGAT AAGATTTAAA 600 
AAGATACTAG AGGCTGTGAG GCTTAGACCA AACAGTATGG AAATTCTAGC GAGATTATCC 660 
CCAGCTGAAA AGAGAATATA CCTTGAGATC TCTAAAATAA GGAGAGAGTG GATAGGTGAT 720 
TAGCGTGGAC AGGGAGGAGA TGATTGAGAG ATTTGCAAAC TTCCTTAGGG AGTATACAGA 780 
CGAAGATGGT AACCCAGTAT ACAGAGGTAA AATAACTGAT TTACTTACAA TAACACCCAA 840 
GAGGTCTGTT GCAATAGACT GGATGCACCT AAATTCCTTT GACTCAGAGC TAGAGTCGAC 900 
CTGCAGGCAT GCATGCAGGT CGACTCTAGA GGATCCCCGG GTACCGAGCT CGAATTC 957 

25 SEQ ID NO: 26 

SEQUENCE LENGTH: 489 

SEQUENCE TYPE: nucleic acid 

STRAND ED NESS : double 
30 TOPOLOGY: linear 

MOLECULAR TYPE: Genomic DNA 

SEQUENCE DESCRIPTION: 

ATGTTCACGG GTAAGGTATT GATTCCAGTA AAAGTACTCA AGAAGTTTGA GAATTGGAAT 60 
35 GAAGGAGATA TGATACTGCT AGAAGATTGG AAAGCCAAGG AATTGTGGGA GAGTGGAGTA 120 

GTTGAAATAA TCGATGAAGC TGATAAAGTC ATAGGAGAGA TCGATAGAGT GTTATCAGAA 180 
GAAAAGAAAA ACCTCCCATT GACTCCAATA CCAGAGGGAC TGTACGAAAA AGCTGAATTT 240 
TACATCTATT ATCTAGAAAA GTACATCCAA GAGAAGGTCG ACAACATAGA AACAATACAA 300 
ACTAAGGTCA CAAAGTTAGC AAATCTAAAG AAGAAGTATA AGACTCTGAA AGAGATAAGA 360 
TTTAAAAAGA TACTAGAGGC TGTGAGGCTT AGACCAAACA GTATGGAAAT TCTAGCGAGA 420 
TTATCCCCAG CTGAAAAGAG AATATACCTT GAGATCTCTA AAATAAGGAG AGAGTGGATA 480 
45 GGTGATTAG 489 
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SEQ ID NO: 27 
SEQUENCE LENGTH: 162 
SEQUENCE TYPE: amino acid 
STRANDEDNESS: single 
TOPOLOGY: linear 
MOLECULAR TYPE: peptide 
SEQUENCE DESCRIPTION: 

Met Phe Thr Gly Lys Val Leu lie Pro Val Lys Val Leu Lys Lys 

5 10 15 

Phe Glu Asn Trp Asn Glu Gly Asp Met lie Leu Leu Glu Asp Trp 
20 25 30 

Lys Ala Lys Glu Leu Trp Glu Ser Gly Val Val Glu lie lie Asp 
35 40 45 

Glu Ala Asp Lys Val lie Gly Glu lie Asp Arg Val Leu Ser Glu 

50 55 60 

Glu Lys Lys Asn Leu Pro Leu Thr Pro He Pro Glu Gly Leu Tyr 

65 70 75 

Glu Lys Ala Glu Phe Tyr He Tyr Tyr Leu Glu Lys Tyr He Gin 

80 85 90 

Glu Lys Val Asp Asn He Glu Thr He Gin Thr Lys Val Thr Lys 
95 100 105 

Leu Ala Asn Leu Lys Lys Lys Tyr Lys Thr Leu Lys Glu He Arg 
110 115 120 

Phe Lys Lys He Leu Glu Ala Val Arg Leu Arg Pro Asn Ser Met 
125 130 135 

Glu He Leu Ala Arg Leu Ser Pro Ala Glu Lys Arg He Tyr Leu 
140 145 150 

Glu He Ser Lys He Arg Arg Glu Trp He Gly Asp 
155 160 



SEQ ID NO: 28 
SEQUENCE LENGTH: 17 
SEQUENCE TYPE: nucleic acid 
STRANDEDNESS: single 
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TOPOLOGY: linear 

MOLECULAR TYPE: other nucleic acid (synthetic DNA) 
SEQUENCE DESCRIPTION: 

ATGAAAGAYT AYAGRCC 17 

SEQ ID NO: 29 
SEQUENCE LENGTH: 20 
SEQUENCE TYPE: nucleic acid 
STRANDEDNESS : single 
TOPOLOGY: linear 

MOLECULAR TYPE: other nucleic acid (synthetic DNA) 
SEQUENCE DESCRIPTION: 

CAAGCWATWA ARGTNAAGGG 20 

SEQ ID NO: 30 
SEQUENCE LENGTH: 20 
SEQUENCE TYPE: nucleic acid 
STRANDEDNESS: single 
TOPOLOGY: linear 

MOLECULAR TYPE: other nucleic acid (synthetic DNA) 
SEQUENCE DESCRIPTION: 

TTCAAGTAAG AGTGAGTTAG 20 

SEQ ID NO: 31 
SEQUENCE LENGTH: 20 
SEQUENCE TYPE: nucleic acid 
STRANDEDNESS: single 
TOPOLOGY: linear 

MOLECULAR TYPE: other nucleic acid (synthetic DNA) 
SEQUENCE DESCRIPTION: 

TAAGTACTCC ACCATTTCCC 20 



SEQ ID NO: 32 
SEQUENCE LENGTH: 1012 
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SEQUENCE TYPE: nucleic acid 
STRANDEDNESS: double 
5 TOPOLOGY : linear 

MOLECULAR TYPE: Genomic DNA 
SEQUENCE DESCRIPTION: 

TCTAGAACAT AGCAGTAAAA CTTTCCTTCT AGTACAACTT CTTCTCCTCT GTAAACTTTC 60 
10 ACATCAACTA TCTTCTTTCT CCCTTGATCC TCCACCACCT GAGCTTTTGC TAAAAGCACG 120 

TCTCCAACTT TCACCGGCTT TGTAAAGCGT ACCTCTGCCT TTCCAAGAAC TACAGTAGGC 180 
TCATTTACAG CAAGCATTGC GGCGTAATCA GCTAAACCAA ATGTAAAGCC CCCGTGAACT 240 
AGCCCCTTCT CATCAACCTT CATCTCGTCA ATGGTTTTCA GTTCCACTTC AGCATACCCC 300 
15 TCTCTTATTA CCCTGGGTTT TCCTACAAGT CTCTCAGATG TCAGATTGTG CGTTTTCTGC 360 

TCCATACCAC CACCGAAAAG AATAAGGTTT TTGAAATTTA AAAGCTAAGG GAGGAGTGAT 420 
GAAAGACTAT AGGCCACTCC TCCAAGCAAT AAAAGTTAAG GGAGATAATG TTTTTTCAAG 480 
TAAGAGTGAG TTAGTTGGTA TTCTAGCCTT TAATTTGGGA ATATTAACAG TTGGTGAGGC 540 
20 AAAAGAACTC ATAGAGGAGG CCATAAAGGA GGGAATCATT GAGGAAACTC CCGAAGGTCT 600 

CATAGTTCAT GAGGATGCCA TAACTGAAAA GGAAAGCAAA AGGGATATAT TCGGGGAAAT 660 
GGTGGAGTAC TTAGCGAGAG AACTTGAGCT TAGCGAGATA GAAGTTCTTG AAGAGATAGA 720 
AAAAATGAAA GAGAGGTACG GAAATTTGGA TAAAAAAATT CTTGCTTACT TATTCGGACT 780 

25 

ATCAAAAGGA GTTAACATGG AGAAATTCAA AGAATACTTG GAGGATGAAT GATGCCCAAA 840 
ATAGAACCTT TTGAAAAGTA CACTGAGAGA TACGAGGAGT GGTTTGAAAG AATAAATTTG 900 
CATACCTCAG TGAGCTTAAT GCCCTGAAAT CTCTTCTTCC TACCAGAGAA TGTGTTGAAG 960 
30 TGGGAATAGG TAGTGGAAGG TTTGCGGCTC CCCTGGGAAT TAAGATGGGG GT 1012 



SEQ ID NO: 33 
SEQUENCE LENGTH: 414 

35 SEQUENCE TYPE: nucleic acid 

STRANDEDNESS: double 
TOPOLOGY: linear 
MOLECULAR TYPE: Genomic DNA 

40 SEQUENCE DESCRIPTION: 

ATGAAAGACT ATAGGCCACT CCTCCAAGCA ATAAAAGTTA AGGGAGATAA TGTTTTTTCA 60 
AGTAAGAGTG AGTTAGTTGG TATTCTAGCC TTTAATTTGG GAATATTAAC AGTTGGTGAG 120 
GCAAAAGAAC TCATAGAGGA GGCCATAAAG GAGGGAATCA TTGAGGAAAC TCCCGAAGGT 180 
CTCATAGTTC ATGAGGATGC CATAACTGAA AAGGAAAGCA AAAGGGATAT ATTCGGGGAA 240 
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10 



15 



ATGGTGGAGT ACTTAGCGAG AGAACTTGAG CTTAGCGAGA TAGAAGTTCT TGAAGAGATA 300 
GAAAAAATGA AAGAGAGGTA CGGAAATTTG GATAAAAAAA TTCTTGCTTA CTTATTCGGA 360 
5 CTATCAAAAG GAGTTAACAT GGAGAAATTC AAAGAATACT TGGAGGATGA ATGA 414 

SEQ ID NO: 34 
SEQUENCE LENGTH: 137 
SEQUENCE TYPE? amino acid 
STRANDEDNESS: single 
TOPOLOGY: linear 
MOLECULAR TYPE: peptide 
SEQUENCE DESCRIPTION: 

Met Lys Asp Tyr Arg Pro Leu Leu Gin Ale lie Lys Val Lys Gly 
5 10 15 

20 Asp Asn Val Phe Ser Ser Lys Ser Glu Leu Val Gly lie Leu Ala 

20 25 30 

Phe Asn Leu Gly lie Leu Thr Val Gly Glu Ala Lys Glu Leu lie 
35 40 45 

Glu Glu Ala lie Lys Glu Gly He He Glu Glu Thr Pro Glu Gly 
50 55 60 

Leu He Val His Glu Asp Ala He Thr Glu Lys Glu Ser Lys Arg 
65 70 75 

Asp He Phe Gly Glu Met Val Glu Tyr Leu Ala Arg Glu Leu Glu 
80 85 90 

Leu Ser Glu He Glu Val Leu Glu Glu He Glu Lys Met Lys Glu 
35 95 100 105 

Arg Tyr Gly Asn Leu Asp Lys Lys He Leu Ala Tyr Leu Phe Gly 
110 115 120 

Leu Ser Lys Gly Val Asn Met Glu Lys Phe Lys Glu Tyr Leu Glu 
40 125 130 135 

Asp Glu 



25 



30 



45 



50 



55 



SEQ ID NO: 35 
SEQUENCE LENGTH: 33 
SEQUENCE TYPE: nucleic acid 
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STRAND EDNESS : single 
TOPOLOGY: linear 

MOLECULAR TYPE: other nucleic acid (synthetic DNA) 
SEQUENCE DESCRIPTION: 

AAAGCTAAGG GAGGACATAT GAAAGACTAT AGG 33 

SEQ ID NO: 36 
SEQUENCE LENGTH: 35 
SEQUENCE TYPE: nucleic acid 
STRANDEDNESS: single 
TOPOLOGY : linear 

MOLECULAR TYPE: other nucleic acid (synthetic DNA) 
SEQUENCE DESCRIPTION: 

TCAAACCACT CCTCGAATTC CTCAGTGTAC TTTTC 35 

SEQ ID NO: 37 
SEQUENCE LENGTH: 20 
SEQUENCE TYPE: nucleic acid 
STRANDEDNESS: single 
TOPOLOGY: linear 

MOLECULAR TYPE: other nucleic acid (synthetic DNA) 
SEQUENCE DESCRIPTION: 

CCWTTYGARA TWGTWTTYGA 20 

SEQ ID NO: 38 
SEQUENCE LENGTH: 20 
SEQUENCE TYPE: nucleic acid 
STRANDEDNESS: single 
TOPOLOGY: linear 

MOLECULAR TYPE: other nucleic acid (synthetic DNA) 
SEQUENCE DESCRIPTION: 

GGWGCWAARG ARTTYGCNCA 20 
SEQ ID NO: 39 
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SEQUENCE LENGTH: 20 
SEQUENCE TYPE: nucleic acid 
5 STRANDEDNESS: single 

TOPOLOGY: linear 

MOLECULAR TYPE: other nucleic acid (synthetic DNA) 
SEQUENCE DESCRIPTION: 

10 

AACTTATAGA CACCGCAAGT 20 

SEQ ID NO: 40 
15 SEQUENCE LENGTH: 20 

SEQUENCE TYPE: nucleic acid 

STRANDEDNESS: single 

TOPOLOGY: linear 
20 MOLECULAR TYPE: other nucleic acid (synthetic DNA) 

SEQUENCE DESCRIPTION: 

GTCACTCTTC AACTCTTGGA 20 

25 SEQ ID NO: 41 

SEQUENCE LENGTH: 989 
SEQUENCE TYPE: nucleic acid 
STRANDEDNESS: double 

30 TOPOLOGY: linear 

MOLECULAR TYPE: Genomic DNA 
SEQUENCE DESCRIPTION: 

^ AAGCTTATAA AAGAATACCC GATACAGACA ATGGAAAAAC TTATTTATTG AGGGGTAAAG 60 

AAAGAGTTAG GCTTATGCTA AACATTCTTA AGGAGGTGGA AAGAGATGCC ATTTGAAATC 120 
GTATTTGAAG GTGCAAAAGA GTTTGCCCAA CTTATAGACA CCGCAAGTAA GTTAATAGAT 180 
GAGGCCGCGT TTAAAGTTAC AGAAGATGGG ATAAGCATGA GGGCCATGGA TCCAAGTAGA 240 

40 GTTGTCCTGA TTGACCTAAA TCTCCCGTCA AGCATATTTA GCAAATATGA AGTTGTTGAA 300 

CCAGAAACAA TTGGAGTTAA CATGGACCAC CTAAAGAAGA TCCTAAAGAG AGGTAAAGCA 360 
AAGGACACCT TAATACTCAA GAAAGGAGAG GAAAACTTCT TAGAGATAAC AATTCAAGGA 420 
ACTGCAACAA GAACATTTAG AGTTCCCCTA ATAGATGTAG AAGAGATGGA AGTTGACCTC 480 

45 CCAGAACTTC CATTCACTGC AAAGGTTGTA GTTCTTGGAG AAGTCCTAAA AGATGCTGTT 540 

AAAGATGCCT CTCTAGTGAG TGACAGCATA AAATTTATTG CCAGGGAAAA TGAATTTATA 600 
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ATGAAGGCAG AGGGAGAAAC CCAGGAAGTT GAGATAAAGC TAACTCTTGA AGATGAGGGA 660 

TTATTGGACA TCGAGGTTCA AGAGGAGACA AAGAGCGCAT ATGGAGTCAG CTATCTCTCC 720 

GACATGGTTA AAGGACTTGG AAAGGOCGAT GAAGTTACAA TAAAGTTTGG AAATGAAATG 780 

CCCATGCAAA TGGAGTATTA CATTAGAGAT GAAGGAAGAC TTACATTCCT ACTGGCTCCA 840 

AGAGTTGAAG AGTGACTTTT CTTTTCCTTA TAATTTAATT TGGGGATAAC AATGGATATT 900 

GAGGTTCTCA GAAGATTATT GGAGAGAGAA CTTTCAAGCG AAGAACTGAC TAAAATAGAG 960 

GAAGAATTTT ATGACGATTT AGAAAGCTT 989 



SEQ ID NO: 42 
75 SEQUENCE LENGTH: 45 

SEQUENCE TYPE: nucleic acid 

STRANDEDNESS: single 

TOPOLOGY: linear 
20 MOLECULAR TYPE: other nucleic acid (synthetic DNA) 

SEQUENCE DESCRIPTION: 

CCGGAACCGC CTCCCTCAGA GCCGCCACCC TCAGAACCGC CACCC 45 



SEQ ID NO: 43 
SEQUENCE LENGTH: 23 
SEQUENCE TYPE: nucleic acid 
STRANDEDNESS: single 
TOPOLOGY: linear 

MOLECULAR TYPE: other nucleic acid (synthetic DNA) 
SEQUENCE DESCRIPTION : 

CCWTGGGTWG ARAARTAYAG RCC 23 

SEQ ID NO: 44 
SEQUENCE LENGTH: 20 
SEQUENCE TYPE: nucleic acid 
STRANDEDNESS: single 
TOPOLOGY: linear 

MOLECULAR TYPE: other nucleic acid (synthetic DNA) 
SEQUENCE DESCRIPTION: 

WSWGATGAAA GAGGNATHGA 20 
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SEQ ID NO: 45 
SEQUENCE LENGTH: 20 
SEQUENCE TYPE: nucleic acid 
STRAND EDNESS : single 
TOPOLOGY: linear 

MOLECULAR TYPE: other nucleic acid (synthetic DNA) 
SEQUENCE DESCRIPTION: 

GCWTTWAGAA GAACNATGGA 20 

SEQ ID NO: 46 
SEQUENCE LENGTH: 20 
SEQUENCE TYPE: nucleic acid 
STRAND EDNESS : single 
TOPOLOGY: linear 

MOLECULAR TYPE: other nucleic acid (synthetic DNA) 
SEQUENCE DESCRIPTION: 

TTWCCWACWC CWGGWGGNCC 20 

SEQ ID NO: 47 
SEQUENCE LENGTH: 20 
SEQUENCE TYPE: nucleic acid 
STRANDEDNESS: single 
TOPOLOGY: linear 

MOLECULAR TYPE: other nucleic acid (synthetic DNA) 
SEQUENCE DESCRIPTION: 

CTTCTTAAWG CATTYTGNGC 20 

SEQ ID NO: 48 
SEQUENCE LENGTH: 23 
SEQUENCE TYPE: nucleic acid 
STRANDEDNESS: single 
TOPOLOGY: linear 

MOLECULAR TYPE: other nucleic acid (synthetic DNA) 
SEQUENCE DESCRIPTION: 
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ATWATTTTWS WWGGATARTT RCA 23 



SEQ ID NO: 49 
SEQUENCE LENGTH: 20 
SEQUENCE TYPE: nucleic acid 
STRANDEDNESS: single 
TOPOLOGY: linear 

MOLECULAR TYPE: other nucleic acid (synthetic DNA) 
SEQUENCE DESCRIPTION: 

ATWGCTTTTC TCATRTCNCC 20 

SEQ ID NO: 50 
SEQUENCE LENGTH: 20 
SEQUENCE TYPE: nucleic acid 
STRANDEDNESS: single 
TOPOLOGY: linear 

MOLECULAR TYPE: other nucleic acid (synthetic DNA) 

SEQUENCE DESCRIPTION: 

ATCTTGAGTT AAAGCGTCGG 20 

SEQ ID NO: 51 
SEQUENCE LENGTH: 20 
SEQUENCE TYPE: nucleic acid 
STRANDEDNESS: single 
TOPOLOGY: linear 

MOLECULAR TYPE: other nucleic acid (synthetic DNA) 
SEQUENCE DESCRIPTION: 

ACGTTCGCTT TATCTTGAGC 20 

SEQ ID NO: 52 
SEQUENCE LENGTH: 20 
SEQUENCE TYPE: nucleic acid 
STRANDEDNESS: single 
TOPOLOGY: linear 
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MOLECULAR TYPE: other nucleic acid (synthetic DNA) 
SEQUENCE DESCRIPTION: 

TCAAAGACTT GACGACATTG 20 

SEQ ID NO: 53 
SEQUENCE LENGTH: 20 
SEQUENCE TYPE: nucleic acid 
STRANDEDNESS: Single 
TOPOLOGY: linear 

MOLECULAR TYPE: other nucleic acid (synthetic DNA) 
SEQUENCE DESCRIPTION: 

TTCTGCTATG TAAAGTATTG 20 

SEQ ID NO: 54 
SEQUENCE LENGTH: 20 
SEQUENCE TYPE: nucleic acid 
STRANDEDNESS: single 
TOPOLOGY: linear 

MOLECULAR TYPE: other nucleic acid (synthetic DNA) 
SEQUENCE DESCRIPTION: 

CAATACTTTA CATAGCAGAA 20 

SEQ ID NO: 55 
SEQUENCE LENGTH: 3620 
SEQUENCE TYPE: nucleic acid 
STRANDEDNESS: double 
TOPOLOGY: linear 
MOLECULAR TYPE: Genomic DNA 
SEQUENCE DESCRIPTION: 

GAGCTCCAGC AACAACAATA ACCCAAGATG GAAAGGACTT TGGAGTAAGG TACTTTGGAT 60 
TACCGGCAGG TCATGAGTTC GCAGCATTCT TAGAGGACAT TGTGGATGTT AGTAGAGAAG 120 
AAACAAACCT TATGGACGAG ACAAAACAGG CCATCAGAAA CATAGACCAG GATGTAAGAA 180 
TATTGGTGTT TGAAACTCCA ACATGCCCAT ACTGTCCACT TGCCGTTAGA ATGGCTCACA 240 
AGTTTGCCAT TGAAAACACA AAAGCTGGGA AAGGTAAGAT ACTTGGGGAT ATGGTCGAGG 300 
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CCATTGAGTA TCCAGAGTG6 GCTGACCAGT ACAATGTAAT GGCAGTACCA AAAATTGTTA 360 
TTCAGGTCAA CGGAGAAGAC AGAGTAGAAT TTGAAGGAGC TTATCCAGAG AAAATGTTCT 420 

5 TAGAGAAGTT ACTCTCAGCT CTCAGCTGAT CTACTGTTTT TCCTTCTTTT CTTCTGTTCT 480 

GTTATTGCCT AGGATAAGCT TAATAATACT TTGATACCTT TCTTAGTTTA GGTGTOTGAG 540 
AGTATGAGCG AAGAGATTAG AGAAGTTAAG GTTCTAGAAA AACCCTGGGT TGAGAAGTAT 600 
AGACCTCAAA GACTTGACGA CATTGTAGGA CAAGAGCACA TAGTGAAAAG GCTCAAGCAC 660 

10 TACGTCAAAA CTGGATCAAT GCCCCACCTA CTCTTCGCAG GCCCCCCTGG TGTCGGAAAG 720 

TGTCTTACTG GAGATACCAA AGTTATAGCT AATGGCCAAC TCTTTGAACT TGGAGAACTT 780 
GTTGAAAAGC TTTCTGGGGG GAGATTTGGA CCAACTCCAG TTAAAGGGCT CAAAGTTCTT 840 
GGAATAGATG AGGATGGAAA GCTTAGAGAG TTTGAAGTCC AATACGTCTA CAAAGATAGA 900 

15 

ACTGATAGGT TGATAAAGAT AAAAACTCAC CTTGGCAGGG AGCTTAAAGT AACTCCGTAT 960 
CACCCACTTC TAGTGAATAG AGAGAATGGC GAAATAAAGT GGATTAAGGC TGAAGAACTC 1020 
AAACCTGGCG ACAAGCTTGC AATACCGAGC TTTCTCCCAC TTATAACTGG AGAAAATCCC 1080 

2Q CTTGCAGAGT GGCTTGGTTA CTTTATGGGA AGTGGCTATG CTTATCCAAG TAATTCTGTC 1140 

ATCACGTTCA CTAACGAAGA TCCACTCATA AGACAACGCT TTATGGAACT AACAGAGAAA 1200 
CTTTTCCCTG ATGCAAAGAT AAGGGAAAGA ATTCACGCTG ATGGAACTCC AGAAGTTTAT 1260 
GTGGTATCTA GGAAAGCTTG GAGCCTTGTA AACTCTATTA GCTTAACATT AATACCCAGG 1320 

25 GAGGGGTGGA AAGGAATTCG TTCTTTCCTT AGGGCATATT CCGACTGCAA TGGTCGGATT 1380 

GAAAGTGATG CAATAGTTTT ATCAACCGAT AACAATGATA TGGCCCAGCA GATAGCCTAT 1440 
GCTTTAGCCA GCTTTGGAAT AATAGCTAAA ATGGATGGAG AAGATGTTAT TATCTCAGGC 1500 
TCGGACAACA TAGAGAGGTT CCTAAATGAG ATTGGCTTTA GCACCCAAAG CAAACTTAAA 1560 

30 GAAGCCCAGA AGCTCATTAG AAAAACCAAT GTAAGATCCG ATGGACTAAA GATTAACTAT 1620 

GAGCTAATCT CCTATGTAAA AGACAGGCTT AGGTTAAATG TCAATGATAA AAGAAATTTG 1680 
AGCTACAGAA ATGCAAAGGA GCTTTCTTGG GAACTCATGA AAGAAATTTA TTATCGCCTT 1740 
GAGGAACTGG AGAGACTAAA GAAGGTCTTA TCAGAACCCA TCTTGATCGA CTGGAATGAA 1800 
5 GTAGCAAAGA AGAGTGATGA AGTAATAGAA AAAGCTAAAA TTAGAGCAGA GAAGCTCCTA 1860 

GAATACATAA AAGGAGAGAG AAAGCCAAGT TTCAAGGAGT ACATTGAGAT AGCAAAAGTC 1920 
CTTGGAATTA ACGTTGAACG TACCATCGAA GCTATGAAGA TCTTTGCAAA GAGATACTCA 1980 
AGCTATGCCG AGATTGGAAG AAAACTTGGA ACTTGGAATT TCAATGTAAA AACAATTCTT 2040 

40 

GAGAGCGAGA CAGTGGATAA CGTTGAAATC CTTGAAAAGA TAAGGAAAAT TGAGCTTGAG 2100 
CTCATAGAGG AAATTCTTTC GGATGGAAAG CTCAAAGAAG GTATAGCATA TCTCATTTTC 2160 
CTCTTCCAGA ATGAGCTTTA CTGGGACGAG ATAACTGAAG TAAAAGAGCT TAGGGGAGAC 2220 
45 TTTATAATCT ATGATCTTCA TGTTCCTGGC TACCACAACT TTATTGCTGG GAACATGCCA 2280 

ACAGTAGTCC ATAACACTAC AGCGGCTTTG GCCCTTGCAA GAGAGCTTTT CGGCGAAAAC 2340 
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TGGAGGCATA ACTTCCTCGA GTTGAATGCT TCAGATGAAA GAGGTATAAA CGTAATTAGA 2400 
GAGAAAGTTA AGGAGTTTGC GAGAACAAAG CCTATAGGAG GAGCAAGCTT CAAGATAATT 2460 

5 TTCCTTGATG AGGCCGACGC TTTAACTCAA GATGCCCAAC AAGCCTTAAG AAGAACCATG 2520 

GAAATGTTCT CGAGTAACGT TCGCTTTATC TTGAGCTGTA ACTACTCCTC CAAGATAATT 2580 
GAACCCATAC AGTCTAGATG TGCAATATTC CGCTTCAGAC CTCTCCGCGA TGAGGATATA 2640 
GCGAAGAGAC TAAGGTACAT TGCCGAAAAT GAGGGCTTAG AGCTAACTGA AGAAGGTCTC 2700 

10 CAAGCAATAC TTTACATAGC AGAAGGAGAT ATGAGAAGAG CAATAAACAT TCTGCAAGCT 2760 

GCAGCAGCTC TAGACAAGAA GATCACCGAC GAAAACGTAT TCATGGTAGC GAGTAGAGCT 2820 
AGACCTGAAG ATATAAGAGA GATGATGCTT CTTGCTCTCA AAGGCAACTT CTTGAAGGCC 2880 
AGAGAAAAGC TTAGGGAGAT ACTTCTCAAG CAAGGACTTA GTGGAGAAGA TGTACTAGTT 2940 
S CAGATGCACA AAGAAGTCTT CAACCTGCCA ATAGAGGAGC CAAAGAAGGT TCTGCTTGCT 3000 

GATAAGATAG GAGAGTATAA CTTCAGACTC GTTGAAGGGG CTAATGAAAT AATTCAGCTT 3060 
GAAGCACTCT TAGCACAGTT CACCCTAATT GGGAAGAAGT GATGAAGTAT GCCAGAGCTT 3120 
NCCTTGGGTA GAAAAATACA GGCCAAAAAA GCTAAGTGAA ATTGTAAACC AAGAAGAGGC 3180 

20 

TATAGAGAAA GTTAGAGCGT GGATAGAGAG CTGGTTGCAT GNCCACCCCC TTNAGAAAAA 3240 
AGCCGTATTA TTAGCAGGAC CCCCAGGGAG CGGAAAGACA ACCACAGTNT ACGCTNTAGC 3300 
AAATGAGTAC AACTTTGAAG TCATTGAGCT CAACGCGAGT GATGAGAGAA CTTATGAAAA 3360 

25 AATCTCCAGG TATGTTCAAG CAGCATACAC TATGGATATC CTCGGAAAGA GGAGGAAGAT 3420 

AATCTTCCTC GATGAAGCAG ATAATATAGA GCCCAGCGGA GCTAAGGAAA TCGCAAAACT 3480 
AATTGATAAG GCCAAAAATC CAATAATAAT GGCTGCAAAT AAGTACTGGG AAGTTCCAAA 3540 
AGAGATCCGA GAAAAAGCTG AGCTAGTAGA GTACAAGAGG TTAACCCAGA GAGATGTAAT 3600 

30 GAATGCCTTA ATAAGGATCC 3620 

SBQ ID NO: 56 
SEQUENCE LENGTH: 21 
35 SEQUENCE TYPE: nucleic acid 

STRAND EDNESS : single 
TOPOLOGY: linear 

MOLECULAR TYPE: other nucleic acid (synthetic DNA) 

40 

SEQUENCE DESCRIPTION: 

CTTTCCGACA CCAGGGGGGC C 21 

^ SEQ ID NO: 57 

SEQUENCE LENGTH: 21 



so 
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SEQUENCE TYPE: nucleic acid 
STRAND EDNESS : single 
TOPOLOGY: linear 

MOLECULAR TYPE: other nucleic acid (synthetic DNA) 
SEQUENCE DESCRIPTION: 

ACTACAGCGG CTTT6GCCCT T 21 

SEQ ID NO: 58 
SEQUENCE LENGTH: 23 
SEQUENCE TYPE: nucleic acid 
STRANDEDNESS : single 
TOPOLOGY: linear 

MOLECULAR TYPE: other nucleic acid (synthetic DNA) 
SEQUENCE DESCRIPTION: 

GATGAGTTCG TGTCCGTACA ACT 23 

SEQ ID NO: 59 
SEQUENCE LENGTH: 22 
SEQUENCE TYPE: nucleic acid 
STRANDEDNESS: single 
TOPOLOGY: linear 

MOLECULAR TYPE: other nucleic acid (synthetic DNA) 
SEQUENCE DESCRIPTION: 

ACAAAGCCAG CCGGAATATC TG 22 

SEQ ID NO: 60 
SEQUENCE LENGTH: 20 
SEQUENCE TYPE: nucleic acid 
STRANDEDNESS: single 
TOPOLOGY: linear 

MOLECULAR TYPE: other nucleic acid (synthetic DNA) 
SEQUENCE DESCRIPTION: 

GCTTCTAAAT CATTDATNGC 20 
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SEQ ID NO: 61 
SEQUENCE LENGTH: 20 
SEQUENCE TYPE: nucleic acid 
STRAND EDNESS : single 
TOPOLOGY: linear 

MOLECULAR TYPE: other nucleic acid (synthetic DNA) 
SEQUENCE DESCRIPTION: 
GCGTGGATAG AGAGCTGGTT 20 

15 SEQ ID NO: 62 

SEQUENCE LENGTH: 20 

SEQUENCE TYPE: nucleic acid 

STRANDEDNESS: single 
so TOPOLOGY: linear 

MOLECULAR TYPE: other nucleic acid (synthetic DNA) 

SEQUENCE DESCRIPTION: 

CTCTGGGTTA ACCTCTTGTA 20 

25 

SEQ ID NO: 63 
SEQUENCE LENGTH: 1437 
SEQUENCE TYPE: nucleic acid 

30 STRANDEDNESS: double 

TOPOLOGY: linear 
MOLECULAR TYPE: Genomic DNA 

^ SEQUENCE DESCRIPTION: 

ATGCCAGAGC TTCCCTGGGT AGAAAAATAC AGGCCAAAAA AGTTAAGTGA AATTGTAAAC 60 
CAAGAAGAGG CTATAGAGAA AGTTAGAGCG TGGATAGAGA GCTGGTTGCA TGGCCACCCC 120 
CCTAAGAAAA AAGCCCTATT ATTAGCAGGA CCCCCAGGGA GCGGAAAGAC AACCACAGTC 180 

40 TACGCTCTAG CAAATGAGTA CAACTTTGAA GTCATTGAGC TCAACGCGAG TGATGAGAGA 240 

ACTTATGAAA AAATCTCCAG GTATGTTCAA GCAGCATACA CTATGGATAT CCTCGGAAAG 300 
AGGAGGAAGA TAATCTTCCT CGATGAAGCA GATAATATAG AGCCCAGCGG AGCTAAGGAA 360 
ATCGCAAAAC TAATTGATAA GGCCAAAAAT CCAATAATAA TGGCTGCAAA TAAGTACTGG 420 

« GAAGTTCCAA AAGAGATCCG AGAAAAAGCT GAGCTAGTAG AGTACAAGAG GTTAACCCAG 480 

AGAGATGTAA TGAATGCCTT AATAAGGATC CTAAAGAGGG AAGGTATAAC AGTTCCAAAA 540 



so 



55 



71 



EP0997 530A1 



10 



15 



20 



35 



40 



45 



GAAATCCTCC TAGAAATAGC AAAAAGATCT AGTGGAGATC TAAGAGCAGC TATAAATGAT 600 
CTACAGACCG TTGTAGTGGG TGGTTACGAA GATGCTACGC AAGTTTTGGC ATATAGAGAT 660 
GTAGAAAAGA CAGTCTTTCA AGCCCTAGGA CTCGTCTTTG GAAGTGACAA CGCCAAGAGG 720 
GCAAAGATGG CAATGTGGAA CTTGGACATG TCCCCTGATG AATTCCTGCT ATGGGTAGAT 780 
GAGAACATTC CTCACCTCTA CCTAAATCCA GAGGAGATTG CCCAGGCGTA TGATGCAATT 840 
AGTAGAGCCG ACATATACCT CGGAAGGGCC GCCAGAACTG GAAACTATTC ACTCTGGAAG 900 
TACGCAATAG ATATGATGAC TGCAGGAGTT GCCGTGGCAG GGAGAAAGAG AAGGGGATTT 960 . 
GTCAAGTTTT ATCCTCCCAA CACCCTAAAG ATTTTAGCGG AAAGCAAAGA AGAAAGAGAG 1020 
ATCAGAGAGT CAATAATTAA AAAGATAATA CGAGAGATGC ACATGAGTAG GCTACAGGCA 1080 
ATAGAAACGA TGAAAATAAT TAGAGAGATT TTCGAGAACA ATCTAGACCT TGCTGCGCAC 1140 
TTTACAGTGT TCCTTGGTCT GTCTGAAAAA GAAGTTGAGT TTCTAGCTGG AAAGGAAAAA 1200 
GCTGGTACCA TTTGGGGCAA AGCCTTAGCA TTAAGAAGGA AACTTAAGGA GCTTGGAATA 1260 
AGAGAGGAGG AGAAGCCTAA AGTTGAAATT GAAGAAGAGG AAGAAGAGGA AGAAAAGACC 1320 
GAAGAAGAAA AAGAGGAAAT AGAAGAAAAA CCCGAAGAAG AGAAAGAAGA GGAGAAGAAA 1380 
GAAAAGGAAA AGCCAAAGAA AGGCAAACAA GCAACTCTCT TTGACTTTCT TAAAAAG 1437 



SEQ ID NO: 64 
25 SEQUENCE LENGTH: 479 

SEQUENCE TYPE: amino acid 

STRANDEDNESS: single 

TOPOLOGY: linear 
30 MOLECULAR TYPE: peptide 

SEQUENCE DESCRIPTION: 

Met Pro Glu Leu Pro Trp Val Glu Lys Tyr Arg Pro Lys Lys Leu 
5 10 15 

Ser Glu He Val Asn Gin Glu Glu Ala He Glu Lys Val Arg Ala 
20 25 30 

Trp He Glu Ser Trp Leu His Gly His Pro Pro Lys Lys Lys Ala 
35 40 45 

Leu Leu Leu Ala Gly Pro Pro Gly Ser Gly Lys Thr Thr Thr Val 
50 55 60 

Tyr Ala Leu Ala Asn Glu Tyr Asn Phe Glu Val He Glu Leu Asn 
65 70 75 

Ala Ser Asp Glu Arg Thr Tyr Glu Lys He Ser Arg Tyr Val Gin 
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80 85 90 

Ala Ala Tyr Thr Met Asp lie Leu Gly Lys Arg Arg Lys lie He 

95 100 105 

Phe Leu Asp Glu Ala Asp Asn lie Glu Pro Ser Gly Ala Lys Glu 

110 115 120 

lie Ala Lys Leu lie Asp Lys Ala Lys Asn Pro He He Met Ala 

125 130 135 

Ala Asn Lys Tyr Trp Glu Val Pro Lys Glu He Arg Glu Lys Ala 

140 145 150 

Glu Leu Val Glu Tyr Lys Arg Leu Thr Gin Arg Asp Val Met Asn 

155 160 165 

Ala Leu lie Arg lie Leu Lys Arg Glu Gly lie Thr Val Pro Lys 

170 175 180 

Glu He Leu Leu Glu He Ala Lys Arg Ser Ser Gly Asp Leu Arg 

185 190 195 

Ala Ala He Asn Asp Leu Gin Thr Val Val Val Gly Gly Tyr Glu 

200 205 210 

Asp Ala Thr Gin Val Leu Ala Tyr Arg Asp Val Glu Lys Thr Val 

215 220 225 

Phe Gin Ala Leu Gly Leu Val Phe Gly Ser Asp Asn Ala Lys Arg 

230 235 240 

Ala Lys Met Ala Met Trp Asn Leu Asp Met Ser Pro Asp Glu Phe 

245 250 255 

Leu Leu Trp Val Asp Glu Asn He Pro His Leu Tyr Leu Asn Pro 

260 265 270 

Glu Glu He Ala Gin Ala Tyr Asp Ala He Ser Arg Ala Asp lie 

275 280 285 

Tyr Leu Gly Arg Ala Ala Arg Thr Gly Asn Tyr Ser Leu Trp Lys 

290 295 300 

Tyr Ala He Asp Met Met Thr Ala Gly Val Ala Val Ala Gly Arg 

305 310 315 

Lys Arg Arg Gly Phe Val Lys Phe Tyr Pro Pro Asn Thr Leu Lys 

320 325 330 

He Leu Ala Glu Ser Lys Glu Glu Arg Glu He Arg Glu Ser He 
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335 340 345 

He Lys Lys He He Arg Glu Met His Met Ser Arg Leu Gin Ala 

350 355 360 

He Glu Thr Met Lys He He Arg Glu He Phe Glu Asn Asn Leu 

365 370 375 

Asp Leu Ala Ala His Phe Thr Val Phe Leu Gly Leu Ser Glu Lys 

380 385 390 

Glu Val Glu Phe Leu Ala Gly Lys Glu Lys Ala Gly Thr He Tip 

395 400 405 

Gly Lys Ala Leu Ala Leu Arg Arg Lys Leu Lys Glu Leu Gly He 

410 415 420 

Arg Glu Glu Glu Lys Pro Lys Val Glu He Glu Glu Glu Glu Glu 

425 430 435 

Glu Glu Glu Lys Thr Glu Glu Glu Lys Glu Glu He Glu Glu Lys 

440 445 450 

Pro Glu Glu Glu Lys Glu Glu Glu Lys Lys Glu Lys Glu Lys Pro 

455 460 465 

Lys Lys Gly Lys Gin Ala Thr Leu Phe Asp Phe Leu Lys Lys 

470 475 

SEQ ID NO: 65 
SEQUENCE LENGTH: 23 
SEQUENCE TYPE: nucleic acid 
STRANDED NESS : Single 
TOPOLOGY: linear 

MOLECULAR TYPE: other nucleic acid (synthetic DNA) 
SEQUENCE DESCRIPTION: 

ATGGATATWG ARGTDYTNAG RAG 23 



SEQ ID NO: 66 
SEQUENCE LENGTH: 20 
SEQUENCE TYPE: nucleic acid 
STRANDEDNESS: Single 
TOPOLOGY: linear 



74 



EP 0 997 530 A1 



10 



15 



MOLECULAR TYPE: other nucleic acid (synthetic DNA) 
SEQUENCE DESCRIPTION: 

ATWGARGTWY TWAGRAGRYT 20 

SEQ ID NO: 67 
SEQUENCE LENGTH : 20 
SEQUENCE TYPE: nucleic acid 
STRANDEDNESS: single 
TOPOLOGY: linear 

MOLECULAR TYPE: other nucleic acid (synthetic DNA) 
SEQUENCE DESCRIPTION: 

GAGAGAGAAC TTTCAAGCGA 20 

20 SEQ ID NO: 68 

SEQUENCE LENGTH: 20 

SEQUENCE TYPE: nucleic add 

STRANDEDNESS: single 
25 TOPOLOGY: linear 

MOLECULAR TYPE: other nucleic acid (synthetic DNA) 

SEQUENCE DESCRIPTION : 

CTCTAAGAAG ATATGCCTCT 20 

30 

SEQ ID NO: 69 

SEQUENCE LENGTH: 558 
35 SEQUENCE TYPE: nucleic add 

STRANDEDNESS: double 

TOPOLOGY: linear 

MOLECULAR TYPE: Genomic DNA 
40 SEQUENCE DESCRIPTION: 

ATGGATATTG AGGTTCTCAG AAGATTATTG GAGAGAGAAC TTTCAAGCGA AGAACTGACT 60 

AAAATAGAGG AAGAATTTTA TGACGATTTA GAAAGCTTTA GAAAAGCCTT GGAAATCAAT 120 

GCCGAGAGAC ATGAAGAAAG AGGAGAGGAC ATTCACAAAA AGCTGTATTT AGCTCAACTA 180 
45 TCTTTGGTTA GGAATCTTGT TAGAGAAATA TTAAGGATTA GGTTGCATAA GATTGTTGAT 240 

ATGGCATTTG AGGGAGTTCC CAGAAATTTA GTTGGAGATG AAAAGAAAAT ATACAAGATA 300 
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20 



ATAACAGCTT TCATAAATGG AGAACCTCTT GAAATTGAAA CGGCAGGAGA AGAGAGTATT 360 
GAAGTTATTG AAGAGGAAAA AGAAACATCT CCTGGGATAA TAGAGGCATA TCTTCTTAGA 420 
5 GTTGATATTC CCAAAATATT GGATGAAAAT TTGAGAGAAT ATGGGOCCTT CAAGGCTGGC 480 

GATCTTGTTG TATTGCCGAA GTCTATTGGC AGGGTACTCA TTCAGAGGGA TGCCGCGGAT 540 
AAGGTATTGA TACAATTG 558 

10 SEQ ID NO: 70 

SEQUENCE LENGTH: 186 
SEQUENCE TYPE: amino acid 
STRAND EDNESS : single 
TOPOLOGY: linear 
MOLECULAR TYPE: peptide 
SEQUENCE DESCRIPTION: 

Met Asp lie Glu Val Leu Arg Arg Leu Leu Glu Arg Glu Leu Ser 
5 10 15 

Ser Glu Glu Leu Thr Lys lie Glu Glu Glu Phe Tyr Asp Asp Leu 
20 25 30 

Glu Ser Phe Arg Lys Ala Leu Glu lie Asn Ala Glu Arg His Glu 
35 40 45 

Glu Arg Gly Glu Asp lie His Lys Lys Leu Tyr Leu Ala Gin Leu 
50 55 60 

Ser Leu Val Arg Asn Leu Val Arg Glu He Leu Arg He Arg Leu 
65 70 75 

His Lys He Val Asp Met Ala Phe Glu Gly Val Pro Arg Asn Leu 
80 85 90 

Val Gly Asp Glu Lys Lys He Tyr Lys He He Thr Ala Phe He 
95 100 105 

Asn Gly Glu Pro Leu Glu He Glu Thr Ala Gly Glu Glu Ser He 
110 115 120 

Glu Val He Glu Glu Glu Lys Glu Thr Ser Pro Gly He He Glu 
125 130 135 

Ala Tyr Leu Leu Arg Val Asp He Pro Lys He Leu Asp Glu Asn 
«* 140 145 150 

Leu Arg Glu Tyr Gly Pro Phe Lys Ala Gly Asp Leu Val Val Leu 
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155 160 165 

Pro Lys Ser lie Gly Arg Val Leu lie Gin Arg Asp Ala Ala Asp 

170 175 180 

Lys Val Leu lie Gin Leu 

185 

SEQ ID NO: 71 
SEQUENCE LENGTH: 33 
SEQUENCE TYPE: nucleic acid 
STRAND EDNESS : single 
TOPOLOGY: linear 

MOLECULAR TYPE: other nucleic acid (synthetic DNA) 
SEQUENCE DESCRIPTION: 

TTTAATTTGG GGATAACCAT GGATATTGAG GTT 33 

SEQ ID NO: 72 
SEQUENCE LENGTH: 31 
SEQUENCE TYPE: nucleic acid 
STRANDEDNESS: single 
TOPOLOGY: linear 

MOLECULAR TYPE: other nucleic acid (synthetic DNA) 
SEQUENCE DESCRIPTION: 

TAGGATGGGT TTTGGATCCT CTCATTGGAG G 31 

SEQ ID NO: 73 
SEQUENCE LENGTH: 20 
SEQUENCE TYPE: nucleic acid 
STRANDEDNESS: single 
TOPOLOGY: linear 

MOLECULAR TYPE: other nucleic acid (synthetic DNA) 
SEQUENCE DESCRIPTION: 

ATGATWGGWW SWATHTTYTA 20 



SEQ ID NO: 74 
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SEQUENCE LENGTH: 23 
SEQUENCE TYPE: nucleic acid 
STRANDEDNESS: single 
TOPOLOGY: linear 

MOLECULAR TYPE: other nucleic acid (synthetic DNA) 
SEQUENCE DESCRIPTION: 

AAGAAGTTTA ATYTDCAYAG RCC 23 

SEQ ID NO: 75 
SEQUENCE LENGTH: 20 
SEQUENCE TYPE: nucleic acid 
STRANDEDNESS : single 
TOPOLOGY: linear 

MOLECULAR TYPE: other nucleic acid (synthetic DNA) 
SEQUENCE DESCRIPTION: 

TGAGTATCAT CCAGAGAATC 20 

SEQ ID NO: 76 
SEQUENCE LENGTH: 20 
SEQUENCE TYPE: nucleic acid 
STRANDEDNESS: single 
TOPOLOGY: linear 

MOLECULAR TYPE: other nucleic acid (synthetic DNA) 
SEQUENCE DESCRIPTION: 

TCACATCGGG ATCGTTCCAG 20 

SEQ ID NO: 77 
SEQUENCE LENGTH: 20 
SEQUENCE TYPE: nucleic acid 
STRANDEDNESS: single 
TOPOLOGY: linear 

MOLECULAR TYPE: other nucleic acid (synthetic DNA) 
SEQUENCE DESCRIPTION: 

GATTTTGACG CTCATCATGG 20 



78 



EP0 997 530A1 



SEQ ID NO: 78 
SEQUENCE LENGTH: 20 
SEQUENCE TYPE: nucleic acid 
STRANDEDNESS: single 
TOPOLOGY: linear 

MOLECULAR TYPE: other nucleic acid (synthetic DNA) 
SEQUENCE DESCRIPTION: 

G6AAA6AAC6 ATTTCGAGTC 20 

SEQ ID NO: 79 
SEQUENCE LENGTH: 1005 
SEQUENCE TYPE: nucleic acid 
STRANDEDNESS: double 
TOPOLOGY: linear 
MOLECULAR TYPE: Genomic DNA 
SEQUENCE DESCRIPTION: 

ATGATTGGCT CAATATTTTA TTCCAAGAAG TTTAACCTCC ATAGACCTAG TGAGTATCAT 60 
CCAGAGAATC CCAAGAGACT CGAAATCGTT CTTTCCAAGG TCAGAGAGCT TGGACTTGAA 120 
GAAAGAATAG AAGAACCAAA CCCAGTTGAA GAGACTTTCG TTGAGAAAAT TCACGACAGG 180 
GATTACATCA ACTTCGTTAA AGAGGCCGTT GAAAAAGGAA TCACAAGACT TGATCCAGAC 240 
ACTTATGTTT CTCCTGGGAC TTGGAGTGCG GCATTGTTAG CTTTAGGAGC CGCAAGGAGT 300 
GCAGCTTTAT CAGCCCTTCA CTATGGAGGC CTCCACATGG CTCTAGTTAG GCCCCCTGGG 360 
CATCATGCAG GGAGAAGAGG AAGGGCCATG GGTGCCCCAA CACTAGGCTT CTGCATCTTC 420 
AACAACGCGG CCTCTGCAGT TGTCACCTTG AAAGAAGAGG GAGTTGGAAA AGTTGTTGTA 480 
ATAGATTTTG ACGCTCATCA TGGAAACGGG ACTCAGGAAA TATTCTGGAA CGATCCCGAT 540 
GTGATTCACA TAGATCTACA CGAGAGAGAC ATCTACCCAG GGAGTGGGGA TGTGAGTGAA 600 
GTTGGAGGGT CAAATGCTTA TGGGAGCAAG ATAAACCTCC CAATGCCCCA CTATTCTGGG 660 
GATGGGGATT ACATATATGT TTGGGACGAA ATTGTGCTTC CAATAGTTGA AGAAGTTAAG 720 
CCAAAGGTCA TCGTAATTTC CGCGGGCTTT GATGGATTTA AAGGGGATGG TCTAACAACA 780 
TTAAGGCTCA CAGAAAGTTT TTACTCTTAT GCAGGGGCTA CATTAAATAA ATATCCCTTG 840 
GCATTTATAT TGGAAGGCGG GTATGGAGTA GGGTTAGATA AAGGTTTTCC GGCCTTCATA 900 
ATGGGCTACG AAGAGGGTAA AGCGAAAGCT CGAGAAGAGC CAAGATATGA GACCCTAAAG 960 
TTGGCGGAGG AGGTTAAGGA CATCTTGAGT CCCTGGTGGT CGTTA 1005 
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SEQ ID NO: 80 
SEQUENCE LENGTH: 335 
SEQUENCE TYPE: amino acid 
STRANDEDNBSS : S ingle 
TOPOLOGY: linear 
MOLECULAR TYPE: peptide 
SEQUENCE DESCRIPTION: 

Met He Gly Ser He Phe Tyr Ser Lys Lys Phe Asn Leu His Arg 

5 10 15 

Pro Ser Glu Tyr His Pro Glu Asn Pro Lys Arg Leu Glu He Val 
20 25 30 

Leu Ser Lys Val Arg Glu Leu Gly Leu Glu Glu Arg He Glu Glu 
35 40 45 

Pro Asn Pro Val Glu Glu Thr Phe Val Glu Lys He His Asp Arg 
50 55 60 

Asp Tyr He Asn Phe Val Lys Glu Ala Val Glu Lys Gly He Thr 
65 70 75 

Arg Leu Asp Pro Asp Thr Tyr Val Ser Pro Gly Thr Trp Ser Ala 
80 85 90 

Ala Leu Leu Ala Leu Gly Ala Ala Arg Ser Ala Ala Leu Ser Ala 
95 100 105 

Leu His Tyr Gly Gly Leu His Met Ala Leu Val Arg Pro Pro Gly 
HO 115 120 

His His Ala Gly Arg Arg Gly Arg Ala Met Gly Ala Pro Thr Leu 
125 130 135 

Gly Phe Cys He Phe Asn Asn Ala Ala Ser Ala Val Val Thr Leu 
140 145 150 

Lys Glu Glu Gly Val Gly Lys Val Val Val He Asp Phe Asp Ala 
155 160 165 

His His Gly Asn Gly Thr Gin Glu He Phe Trp Asn Asp Pro Asp 
170 175 180 

Val He His He Asp Leu His Glu Arg Asp He Tyr Pro Gly Ser 
185 190 195 

Gly Asp Val Ser Glu Val Gly Gly Ser Asn Ala Tyr Gly Ser Lys 
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200 

lie Asn Leu Pro Met Pro His Tyr 
215 

Tyr Val Trp Asp Glu He Val Leu 
230 

Pro Lys Val He Val He Ser Ala 
245 

Asp Gly Leu Thr Thr Leu Arg Leu 
260 

Ala Gly Ala Thr Leu Asn Lys Tyr 
275 

Gly Gly Tyr Gly Val Gly Leu Asp 
290 

Met Gly Tyr Glu Glu Gly Lys Ala 
305 

Tyr Glu Thr Leu Lys Leu Ala Glu 
320 

Pro Trp Trp Ser Leu 

335 

SEQ ID NO: 81 
SEQUENCE LENGTH: 36 
SEQUENCE TYPE: nucleic acid 
STRAND ED NESS : single 
TOPOLOGY: linear 
MOLECULAR TYPE: other nucleic acid (synthetic DNA) 
SEQUENCE DESCRIPTION: 

GGGAAGAAGT GATGACATAT GCCAGAGCTT CCCTGG 36 



SEQ ID NO: 82 
SEQUENCE LENGTH: 20 
SEQUENCE TYPE: nucleic acid 
STRANDEDNESS: single 
TOPOLOGY: linear 



205 210 
Ser Gly Asp Gly Asp Tyr He 
220 225 
Pro He Val Glu Glu Val Lys 
235 240 
Gly Phe Asp Gly Phe Lys Gly 
250 255 
Thr Glu Ser Phe Tyr Ser Tyr 
265 270 
Pro Leu Ala Phe He Leu Glu 
280 285 
Lys Gly Phe Pro Ala Phe He 
295 300 
Lys Ala Arg Glu Glu Pro Arg 
310 315 
Glu Val Lys Asp He Leu Ser 
325 330 
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MOLECULAR TYPE: other nucleic acid (synthetic DMA) 
SEQUENCE DESCRIPTION: 
5 TTCCAAGCTC CTTAAGTTTC 20 

SEQ ID NO: 83 
SEQUENCE LENGTH: 3574 
10 SEQUENCE TYPE: nucleic acid 

STRAND EDNESS : double 
TOPOLOGY: linear 

MOLECULAR TYPE: other nucleic acid (synthetic DNA) 

15 

SEQUENCE DESCRIPTION: 

CATATGCCAG AGCTTCCCTG GGTAGAAAAA TACAGGCCAA AAAAGTTAAG TGAAATTGTA 60 
AACCAAGAAG AGGCTATAGA GAAAGTTAGA GCGTGGATAG AGAGCTGGTT GCATGGCCAC 120 
„ CCCCCTAAGA AAAAAGCCCT ATTATTAGCA GGACCCCCAG GGAGCGGAAA GACAACCACA 180 

20 

GTCTACGCTC TAGCAAATGA GTACAACTTT GAAGTCATTG AGCTCAACGC GAGTGATGAG 240 
AGAACTTATG AAAAAATCTC CAGGTATGTT CAAGCAGCAT ACACTATGGA TATCCTCGGA 300 
AAGAGGAGGA AGATAATCTT CCTCGATGAA GCAGATAATA TAGAGCCCAG CGGAGCTAAG 360 

25 GAAATCGCAA AACTAATTGA TAAGGCCAAA AATCCAATAA TAATGGCTGC AAATAAGTAC 420 

TGGGAAGTTC CAAAAGAGAT CCGAGAAAAA GCTGAGCTAG TAGAGTACAA GAGGTTAACC 480 
CAGAGAGATG TAATGAATGC CTTAATAAGG ATCCTAAAGA GGGAAGGTAT AACAGTTCCA 540 
AAAGAAATCC TCCTAGAAAT AGCAAAAAGA TCTAGTGGAG ATCTAAGAGC AGCTATAAAT 600 

30 GATCTACAGA CCGTTGTAGT GGGTGGTTAC GAAGATGCTA CGCAAGTTTT GGCATATAGA 660 

GATGTAGAAA AGACAGTCTT TCAAGCCCTA GGACTCGTCT TTGGAAGTGA CAACGCCAAG 720 
AGGGCAAAGA TGGCAATGTG GAACTTGGAC ATGTCCCCTG ATGAATTCCT GCTATGGGTA 780 
GATGAGAACA TTCCTCACCT CTACCTAAAT CCAGAGGAGA TTGCCCAGGC GTATGATGCA 840 

35 ATTAGTAGAG CCGACATATA CCTCGGAAGG GCCGCCAGAA CTGGAAACTA TTCACTCTGG 900 

AAGTACGCAA TAGATATGAT GACTGCAGGA GTTGCCGTGG CAGGGAGAAA GAGAAGGGGA 960 
TTTGTCAAGT TTTATCCTCC CAACACCCTA AAGATTTTAG CGGAAAGCAA AGAAGAAAGA 1O20 
GAGATCAGAG AGTCAATAAT TAAAAAGATA ATACGAGAGA TGCACATGAG TAGGCTACAG 1080 

40 GCAATAGAAA CGATGAAAAT AATTAGAGAG ATTTTCGAGA ACAATCTAGA CCTTGCTGCG 1140 

CACTTTACAG TGTTCCTTGG TCTGTCTGAA AAAGAAGTTG AGTTTCTAGC TGGAAAGGAA 1200 
AAAGCTGGTA CCATTTGGGG CAAAGCCTTA GCATTAAGAA GGAAACTTAA GGAGCTTGGA 1260 

^ ATAAGAGAGG AGGAGAAGCC TAAAGTTGAA ATTGAAGAAG AGGAAGAAGA GGAAGAAAAG 1320 

ACCGAAGAAG AAAAAGAGGA AATAGAAGAA AAACCCGAAG AAGAGAAAGA AGAGGAGAAG 1380 
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AAAGAAAAGG AAAAGCCAAA GAAAGGCAAA CAAGCAACTC TCTTTGACTT TCTTAAAAAG 1440 
TGATTACCCT TTTTCTTCTA TTAGAGCTCC GAATAAAGTT GGCCCTCTAA TTTTTTCTAT 1500 
TGTCTCCTCC ACATTAATCT TTACGAATTC GAGCTCCAGC AACAACAATA ACCCAAGATG 1560 
GAAAGGACTT TGGAGTAAGG TACTTTGGAT TACCGGCAGG TCATGAGTTC GCAGCATTCT 1620 
TAGAGGACAT TGTGGATGTT AGTAGAGAAG AAACAAACCT TATGGACGAG ACAAAACAGG 1660 
CCATCAGAAA CATAGACCAG GATGTAAGAA TATTGGTGTT TGAAACTCCA ACATGCCCAT 1740 
ACTGTCCACT TGCCGTTAGA ATGGCTCACA AGTTTGCCAT TGAAAACACA AAAGCTGGGA 1800 
AAGGTAAGAT ACTTGGGGAT ATGGTCGAGG CCATTGAGTA TCCAGAGTGG GCTGACCAGT 1860 
ACAATGTAAT GGCAGTACCA AAAATTGTTA TTCAGGTCAA CGGAGAAGAC AGAGT AGAAT 1920 
TTGAAGGAGC TTATCCAGAG AAAATGTTCT TAGAGAAGTT ACTCTCAGCT CTCAGCTGAT 1980 
CTACTGTTTT TCCTTCTTTT CTTCTGTTCT GTTATTGCCT AGGATAAGCT TAATAATACT 2040 
TTGATACCTT TCTTAGTTTA GGTGTGTGAG AGTATGAGCG AAGAGATTAG AGAAGTTAAG 2100 
GTTCTAGAAA AACCCTGGGT TGAGAAGTAT AGACCTCAAA GACTTGACGA CATTGTAGGA 2160 
CAAGAGCACA TAGTGAAAAG GCTGAAGCAC TACGTCAAAA CTGGATCAAT GCCCCACCTA 2220 
CTCTTCGCAG GCCCCCCTGG TGTCGGAAAG ACTACAGCGG CTTTGGCCCT TGCAAGAGAG 2280 
CTTTTCGGCG AAAACTGGAG GCATAACTTC CTCGAGTTGA ATGCTTCAGA TGAAAGAGGT 2340 
ATAAACGTAA TTAGAGAGAA AGTTAAGGAG TTTGCGAGAA CAAAGCCTAT AGGAGGAGCA 2400 
AGCTTCAAGA TAATTTTCCT TGATGAGGCC GACGCTTTAA CTCAAGATGC CCAACAAGCC 2460 
TTAAGAAGAA CCATGGAAAT GTTCTCGAGT AACGTTCGCT TTATCTTGAG CTGTAACTAC 2520 
TCCTCCAAGA TAATTGAACC CATACAGTCT AGATGTGCAA TATTCCGCTT CAGACCTCTC 2580 
CGCGATGAGG ATATAGOGAA GAGACTAAGG TACATTGCCG AAAATGAGGG CTTAGAGCTA 2640 
ACTGAAGAAG GTCTCCAAGC AATACTTTAC ATAGCAGAAG GAGATATGAG AAGAGCAATA 2700 
AACATTCTGC AAGCTGCAGC AGCTCTAGAC AAGAAGATCA CCGACGAAAA CGTATTCATG 2760 
GTAGCGAGTA GAGCTAGACC TGAAGATATA AGAGAGATGA TGCTTCTTGC TCTCAAAGGC 2820 
AACTTCTTGA AGGCCAfcAGA AAAGCTTAGG GAGATACTTC TCAAGCAAGG ACTTAGTGGA 2880 
GAAGATGTAC TAGTTCAGAT GCACAAAGAA GTCTTCAACC TGCCAATAGA GGAGCCAAAG 2940 
AAGGTTCTGC TTGCTGATAA GATAGGAGAG TATAACTTCA GACTCGTTGA AGGGGCTAAT 3000 
GAAATAATTC AGCTTGAAGC ACTCTTAGCA CAGTTCACCC TAATTGGGAA GAAGTGATGA 3060 
AGTATGCCAG AGCTTCOCTG GGTAGAAAAA TACAGGCCAA AAAAGTTAAG TGAAATTGTA 3120 
AACCAAGAAG AGGCTATAGA GAAAGTTAGA GCGTGGATAG AGAGCTGGTT GCATGGCCAC 3180 
CCCCCTAAGA AAAAAGCOGT ATTATTAGCA GGACCCCCAG GGAGCGGAAA GACAACCACA 3240 
GTCTACGCTC TAGCAAATGA GTACAACTTT GAAGTCATTG AGCTCAACGC GAGTGATGAG 3300 
AGAACTTATG AAAAAATCTC CAGGTATGTT CAAGCAGCAT ACACTATGGA TATCCTCGGA 3360 
AAGAGGAGGA AGATAATCTT CCTCGATGAA GCAGATAATA TAGAGCCCAG CGGAGCTAAG 3420 
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GAAATCGCAA AACTAATTGA TAAGGCCAAA AATCCAATAA TAATGGCTGC AAATAAGTAC 3480 
TGGGAAGTTC CAAAAGAGAT CCGAGAAAAA GCTGAGCTAG TAGAGTACAA GAGGTTAACC 3540 
CAGAGAGATG TAATGAATGC CTTAATAAGG ATCC 3574 

SEQ ID NO: 84 
SEQUENCE LENGTH: 33 
SEQUENCE TYPE: nucleic acid 
STRAND EDNESS : single 
TOPOLOGY: linear 

MOLECULAR TYPE: other nucleic acid (synthetic DNA) 
SEQUENCE DESCRIPTION: 

TACTTGTAAT ATTCTCATAT GATTGGCTCA ATA 33 

SEQ ID NO: 85 
SEQUENCE LENGTH: 35 
SEQUENCE TYPE: nucleic acid 
STRAND EDNESS : single 
TOPOLOGY: linear 

MOLECULAR TYPE: other nucleic acid (synthetic DNA) 
SEQUENCE DESCRIPTION: 

GATGAGTTCG TGTCCGTACA ACTGGCGTAA TCATG 35 

SEQ ID NO: 86 
SEQUENCE LENGTH: 25 
SEQUENCE TYPE: nucleic acid 
STRANDEDNESS: single 
TOPOLOGY: linear 

MOLECULAR TYPE: other nucleic acid (synthetic DNA) 
SEQUENCE DESCRIPTION: 

GGTTATCGAA ATCAGCCACA GCGCC 25 

SEQ ID NO: 87 
SEQUENCE LENGTH: 23 
SEQUENCE TYPE: nucleic acid 
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STRANDEDNESS: single 
TOPOLOGY: linear 

MOLECULAR TYPE: other nucleic acid (synthetic DNA) 
SEQUENCE DESCRIPTION; 

GCGTACCTTT GTCTCACGGG CAA 23 

SEQ ID NO: 88 
SEQUENCE LENGTH: 22 
SEQUENCE TYPE: nucleic acid 
strandedness: single 
TOPOLOGY: linear 

MOLECULAR TYPE: other nucleic acid (synthetic DNA) 
SEQUENCE DESCRIPTION: 

GATAGCTGTC GTCATAGGAC TC 22 

SEQ ID NO: 89 
SEQUENCE LENGTH: 23 
SEQUENCE TYPE: nucleic acid 
STRANDEDNESS: single 
TOPOLOGY: linear 

MOLECULAR TYPE: other nucleic acid (synthetic DNA) 
SEQUENCE DESCRIPTION: 

CTTAACCAGT GCGCTGAGTG ACT 23 

SEQ ID NO: 90 
SEQUENCE LENGTH: 2B 
SEQUENCE TYPE: nucleic acid 
STRANDEDNESS: single 
TOPOLOGY: linear 

MOLECULAR TYPE: other nucleic acid (synthetic DNA) 
SEQUENCE DESCRIPTION: 

GACAATCTGG AATACGCCAC CTGACTTG 28 
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SEQUENCE LENGTH: 28 
SEQUENCE TYPE: nucleic acid 
STRANDEDNESS: single 
TOPOLOGY: linear 

MOLECULAR TYPE: other nucleic acid (synthetic DNA) 
SEQUENCE DESCRIPTION: 

TTGCCACTTC CGTCAACCAG GCTTATCA 28 

SEQ ID NO: 92 
SEQUENCE LENGTH: 29 
SEQUENCE TYPE: nucleic acid 
STRANDEDNESS: single 
TOPOLOGY: linear 

MOLECULAR TYPE: other nucleic acid (synthetic DNA) 
SEQUENCE DESCRIPTION: 

TGTCCGTCAG CTCATAACGG TACTTCACG 29 



Claims 

1. A thermostable DNA polymerase-associated factor capable of enhancing DNA synthesizing-activity of a DNA 
polymerase. 

2. The DNA polymerase-associated factor according to claim 1, further possessing an activity of binding to a DNA 
polymerase. 

3. The DNA polymerase-associated factor according to claim 2, which possesses an activity of binding to a DNA 
polymerase comprising a DNA pdymerase-constituting protein having the amino acid sequence as shown in SEQ 
ID NO: 5 or 6 in Sequence Listing. 

4. The DNA polymerase-associated factor according to any one of claims 1 to 3, comprising at least one of amino acid 
sequences selected from the group consisting of SEQ ID NOs: 1 , 3, 19, 27, 34, 64, 70 and 80 in Sequence Listing, 
or an amino acid sequence resulting from substitution, deletion, addition or insertion of one or more amino acids in 
at least one of said amino acid sequences. 

5. A gene encoding a DNA polymerase-associated factor, wherein the factor comprises at least one of amino acid 
sequences selected from the group consisting of SEQ ID NOs: 1 . 3, 19, 27, 34, 64, 70 and 80 in Sequence Listing, 
or an amino acid sequence resulting from substitution, deletion, addition or insertion of one or more amino acids in 
at least one of said amino acid sequences, and possesses an activity of enhancing DNA synthesizing-activity of a 
DNA polymerase. 

6. The gene according to claim 5, comprising a nucleotide sequence selected from the group consisting of SEQ ID 
NOs: 2, 4, 18, 26, 33, 63, 69 and 79, or a nucleotide sequence resulting from substitution, deletion, addition or 
insertion of one or more bases in said nucleotide sequence. 
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7. A gene capable of hybridizing to the gene of claim 5 or 6, and encoding a DNA polymerase-associated factor pos- 
sessing an activity of enhancing DNA synthesizing-activity of a DNA polymerase. 

8. A method for producing a DNA polymerase-associated factor, characterized in that the method comprises culturing 
a transfbrmant harboring the gene of any one of claims 5 to 7, and collecting a thermostable DNA polymerase- 
associated factor capable of enhancing DNA synthesizing-activity of a DNA polymerase from the cultured medium. 

9. A method of DNA synthesis by using a DNA polymerase, characterized in that DNA is synthesized in the presence 
of the DNA polymerase-associated factor of any one of claims 1 to 4. 

10. The method of DNA synthesis according to claim 9, wherein DNA is synthesized in the presence of two or more 
kinds of DNA polymerase-associated factors. 

1 1 . The method of DNA synthesis according to claim 1 0, wherein DNA is synthesized in the presence of F7, PFU-RFC 
and PFU-RFCLS as a DNA polymerase-associated factor. 

12. The method of DNA synthesis according to any one of claims 9 to 1 1 , wherein said DNA polymerase is a ther- 
mostable DNA polymerase. 

13. The method of DNA synthesis according to claim 12, wherein the synthesis is carried out by PCR method. 

14. A kit usable for in vitro DNA synthesis, comprising the DNA polymerase-associated factor of any one of claims 1 to 
4 and a DNA polymerase. 

15. The kit according to claim 14, further comprising a reagent required for DNA synthesis. 

16. The kit according to claim 14 or 15. comprising two or more kinds of DNA polymerase-associated factors. 

17. The kit according to daim 16, comprising F7, PFU-RFC and PFU-RFCLS as a DNA polymerase-associated factor. 

18. The kit according to any one of claims 14 to 1 7, comprising a thermostable DNA polymerase as a DNA polymerase. 



87 



EPO 997530 A1 



Molecular 
Weight 

Marker 

K J- PfuPolymeraseC 

60 

45 



30 



F6 
Fl 



F3 
F7 

F5 



F4 

14 



FIG. 1 



EP 0 997 530 A1 



X 1 



.5 o 
K 2 



8 la 
w m £ 

w m a. 



w 
a. 



2 



O. 

8 



FIG. 



89 



EP 0 997 530 A1 




FIG. 3 



90 



EP0997530A1 




EP 0 997 530 A1 



— X 



O) 
.CD 

CO 
X 



CO 

in 
• • 

Q> 



"D 
C 



— CD 



CO 

• • 

O) 
00 



E 

TO 
CO 

■ • 

CO 



FIG. 5 



92 



EP0997 530A1 



X _ 



a 



li 



f 

o 



X J 



FIG. 6 



93 



EP0997530A1 




EP 0 997 530 A1 



Pfu-C 
M - + 



pfu 
- + M 




FIG. 8 



95 



EP0997 530A1 

CO 



to— 



O 
2- 



to. 



to — 



2 



— X 

— X 



— X 



CO 

S 

A: 
o 

CO 

• ■ 

o 
u 
z 

» • 

u 
2 



CD 



2 
X 

E 

03 
CO 

• • 

CO 



FIG. 9 

96 



» » 



EP0997 530A1 



Molecular 
Weight 

Marker 

97K — PfuDNA Polymerase 

60 

45 




21 — 
14 

FIG. 10 



97 



r • 



EP 0 997 530 A1 



a 



>> 

> 



o 
-< 

o 

vi 
CO 

u 
o 

E 



4000 



3000 - 



2000 - 



1000 - 



PfuDNA Polymerase 
PfuDNA Polymerase + F7 




Fraction Numbers 



FIG. 11 



98 



EP0997530A1 



Nh B K Sp Nh 
L_ ,,11 L_l 



HHHHH H H H 
► ► 

PFU-RFC PRJ-RFCLS 
BiBamHI, HtHindlll, Nh:Nhel, K:Kpnl, Sp:Sphl 1 Kb 



FIG. 12 



99 



EPO 997530 A1 



H B N H N P 

L_J I I U 

► ► 

F7 F5 

H: Hlndlll, B: BamHI, N: Ndel, P: Pstl ~7T 

0.2 KD 



FIG. 13 



100 



EPO 997530 A1 



Mol ecular 
Weight 
Marker 

97K — 

60 = PFU-RFCLS 



45 



30 



PFU-RFC 
F7 




FIG. 14 



101 



EP0997 530 A1 



10000 




F7(fmol) 

-• — Pfu DNA "Polymerase +F7 
m — Pfu DNA Polymerase + RFC-N comp 1 ex 
— RFC-N comp lex 



FIG. 15 



102 



EP0 997 



530A1 



GO 



LU 



& 



c 
o 

c 



o 
t« 

W 
OS 

o 
O 

LL 

a 



LU • 
CD' 



ID 
u. 

CL 



3 

a 

Li- 
DC 
i 

3 



FIG. 16 



103 



EPO 997530 A1 



10000 




F7(ftnol) 



-• — Pfu DNA Polymerase +F7 

-* — Pfu DNA Polymerase + F7+rRFC-M complex 



FIG. 17 



104 



EP0 997 530A1 



INTERNATIONAL SEARCH REPORT 



International application No. 
PCT/JP98/02845 



A. CLASSIFICATION OF SUBJECT MATTER 

int. CI 1 C12N15/54, C12N9/12, C12N15/31, C07K14/195, C12P21/02 // 

(C12N15/54, C12R1:01) (C12N9/12, C12Rltl9) (C12N15/31, C12R1:01) 
According to International Patent Classification (IPQ or to both national classification and IPC 



B. FIELDS SEARCHED 



Minimum documentation searched (classification system followed by classification symbols) 

Int. CI* C12N15/54, C12N9/12, C12N15/31, C07K14/195, C12P21/02 



Documentation searched other than minimum documentation to the extent that such documents are included in the fields searched 



Electronic data base consulted during the international search (name of data base and. where practicable, search terms used) 
Genbank/EMBL/DDBJ/GeneSeq, WPI (DIALOG), BIOSIS (DIALOG) 



C DOCUMENTS CONSIDERED TO BE RELEVANT 



Category 4 



Citation of document, with indication, where appropriate, of the relevant passages 



Relevant to claim No. 



PX 
/PA 



JP, 10-84954, A (The Institute of Physical and 

Chenical Research), 

7 April, 1998 (07. 04. 98) 

4V BP, 821058, A2 

The Journal of Japanese Biochemical Society Vol. 68, 
No. 9 (1996) Hirosi Morioka "Structure and function 
of proliferating cell nuclear antigen (PCNA)" 
p. 1542-1548 



1, 9-10, 
12-16, 18 
/2-8, 11, 17 



1-18 



| | Further documents are listed in the continuation of Box C Q See patent family i 



* SpcchU categories of died docameatt 

"A" document defining the general attle of IbewtwhkSvbaot 

coessdtred to be of particular idevuce 
"F cutter document but published oa or ifter the interns rionsj filing due 
"L" docnmcoi waicfc amy throw doubts oo prioniy d»im(«) or winch is 

died to establish the pubticatba date of another dtstioa or other 

speokl maM(n speriued) 

•o- 



T bleidocemeejpuMisbadaflertte 

datea«dfrthicoriflklwitbiaesppac«tio«b^c^toi 

"X" document of pafiicutar relevances i 
«be< 

wbenAcdocomlbtrifi 
*Y" dtKtttnemdpMtkaSjarrdewcgte 

coondered to invohw an uweativo step wfcrafedocasottttai 




"P* documeat published prior to tae internatUmsi filing ds* bat later than 
tae pnonty dale daimed 



being obvious to a person skilled la the an 
"ft" docarnetUmcn^rfg»s»mepelenie«nBJy 



Date of the actual completion of the international search 
25 September, 1998 (25. 09. 98) 



Date of mailing of the international search report 

6 October, 1998 (06. 10. 98) 



Name and mailing address of the ISA/ 
Japanese Patent Office 

Facsimile No. 



Authorized officer 
Telephone Na 



fimn PCT/JSA/210 (second sheet) (July 1992) 



105 



EP0 997 530 A1 



INTERNATIONAL SEARCH REPORT 



International application No. 
PCT/JP98/02845 



A. f Continuation! CLASS TFTCATION OP SUBJECT MftTTBR 
(C12P21/02, C12R1:19) 



Form PCT/T^A/210 (extra sheet) (July 1992) 



106 



